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CRYSTAL STRUCTURE OF A DEACETYLASE AND INHIBITORS THEREOF 



This application claims priority of U.S. Provisional 
Application No. 60/152,753, filed September 8, 1999, the 
contents of which are hereby incorporated by reference. 

This invention has been made with government support under 
National Institutes of Health Grant No. ROl CA-65698. 
Accordingly, the U.S. Government may have certain rights in 
the invention. 



Throughout this application, various publications are 
referenced by author, date and citation. The disclosures of 
these publications in their entireties are hereby 
incorporated by reference into this application in order to 
more fully describe the state of the art as known to those 
skilled therein as of the date of the invention described 
and claimed herein. 



Introduction 

The present invention relates to a histone deacetylase 
homologue from the hyperthermophilic bacterium Aguifex 
aeolicus, HDLP (histone deacetylase like protein; also known 
as AcuCl), which shares 35.2 % sequence identity with human 
histone deacetylase (HDAC1) , that can be co-crystallized 
with an inhibitory ligand, and more particularly, to the 
detailed crystallographic data obtained from said co- 
crystallization -which is disclosed herein. The invention 
also relates to methods of using the crystal structure and 
x-ray crystallographic coordinates of the apo-HDLP and 
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inhibitor-bound HDLP to design, isolate and screen compounds 
which bind to and inhibit the active site of HDLP and HDLP- 
related proteins, such as those proteins belonging to the 
HDAC family, including HDAC1 . 

5 

Background of the Invention 

The reversible modification of histones by acetylation is 
associated with changes in nucleosome conformation and 
chromatin structure, and plays an important role in the 

10 regulation of gene expression (reviewed in Davie and Chadee, 
1998, J. Cell Biochem. Suppl . 30-31 : 203-213) . The histone 
acetylase and deacetylase enzymes that carry out these 
modifications are involved in many cellular processes such 
as cell cycle progression and differentiation, and their 

15 deregulation is associated with several types of human 
cancer (reviewed in Kouzarides, 1999, Curr. Opin. Genet. 
Dev. 2:40-48; Hassig et al . , 1997, Chew. Biol. 4:783-789; 
Fenrick and Heibert, 1998, J. Cell. Biochem. Suppl. 30- 
31 :194-202) . 

20 

Recently, several experimental antitumor compounds, such as 
trichostatin A (TSA) , trapoxin, suberoylanilide hydroxamic 
acid (SAHA) , and phenylbutyrate have been shown to act, at 
least in part, by inhibiting histone deacetylases . Richon et 

25 al., 1998, Proc . Natl. Acad. Sci . , USA 95 : 3003 -3007 ; Yoshida 
et al., 1990, J. Biol. Chem. 265:17174-17179; Kijima et al . , 
1993, J. Biol. Chem. 2££: 22429-22435 . Additionally, diallyl 
sulfide and related molecules (Lea et al . , 1999, Int. J. 
Oncol. 2:347-352), oxamf latin (Kim et al . , 1999, Oncogene 

30 15.: 2461-2470) , MS-27-275, a synthetic benzamide derivative 
(Saito et al . , 1999, Proc. Natl. Acad. Sci. 96:4592-4597), 
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butarate derivatives (Lea and Tulsyan, 1995, Anticancer Res. 
15:879-883), FR901228 (Noka j ima et al . , 1998, Exp. Cell Res. 
24^:126-133), depudecin (Kwon et al . , 1998, Proc . Natl. 
Acad. Sci. USA £5:3356-3361) and m- carboxysinnamic acid 
bishydroxamide (CBHA; Richon et al . , Proc. Natl. Acad. Sci. 
USA 95:3003-3007) have been shown to inhibit histone 
deacetylases . In vitro, these compounds can inhibit the 
growth of fibroblast cells by causing cell cycle arrest in 
the Gl and G2 phases (Richon et al . , 1996, Proc. Natl. Acad. 
Sci. USA 93:5705-5708; Kim et al . , 1999, Oncogene 
lJi:2461-2410; Yoshida et al . , 1995, Bioessays 17:423-430; 
Yoshida & Beppu, 1988, Exp. Cell. Res. 177:122-131), and can 
lead to the terminal differentiation and loss of 
transforming potential of a variety of transformed cell 
lines. Richon et al . , 1996, Proc. Natl. Acad. Sci. USA 
93.: 5705-5708 ; Kim et al . , 1999, Oncogene 18:2461-2470; 
Yoshida et al . , 1987, Cancer Res. 42:3688-3691. In vivo, 
phenylbutyrate is effective in the treatment of acute 
promyelocytic leukemia in conjunction with retinoic acid. 
Warrell et al . , 1998, J. Natl. Cancer Inst. 90:1621-1625. 
SAHA is effective in preventing the formation of mammary 
tumors in rats, and lung tumors in mice. Desai et al . , 
1999, Proc. AACR 40: abstract #2396; Cohen et al . , Cancer 
Res. , submitted. 

Histone deacetylases catalyze the removal of acetyl groups 
from the e -amino groups of lysine residues clustered near 
the N-terminus of nucleosomal histones, and this process is 
associated with transcriptional repression (reviewed in 
Struhl, 1998, Genes Dev. 12:599-606). Deletion of the yeast 
histone deacetylase gene, rpd3 , or its pharmacological 
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inactivation with trichostatin A reduces the transcriptional 
repression in a subset of promoters, such as those of 
Ume6-regulated genes. Kadosh & Struhl, 1998, Mol . Cell. 
Biol. 18.:5121-5127 . This is accompanied by the increased 
5 acetylation of H4 histones in the repressed promoter and its 
vicinity, but has no effect on histones at promoter distal 
regions. Kadosh & Struhl, 1998, Mol. Cell. Biol. 18:5121- 
5127; Rundlett et al . , 1998, Nature 392 : 831-835 . 

10 Histone deacetylases are recruited to specific promoters by 
associating with DNA-binding transcriptional repressors, 
either directly or through co-repressors that bridge the 
deacetylase to the transcriptional repressors. For example, 
the Mad and Ume6 repressors bind to the co-repressor Sin3A 

15 (Laherty et al . , 1997, Cell M:349-356; Hassig et al . , 1997, 

Cell 89:341-347; Kadosh & Struhl, 1997, Cell 89:365-371), 
and the nuclear receptors bind N-CoR and the related SMRT 
co-repressors. Nagy et al . , 1997, Cell 89.: 373-380; Alland 
et al, 1997, Nature 387:49-55; Heinzel et al , 1997, Nature 

20 387:43-48. 

The deregulation of histone deacetylase recruitment appears 
to be one of the mechanisms through which these enzymes 
contribute to tumorigenesis . In acute promyelocytic 

25 leukemia (APL) , chromosomal translocations fuse. the retinoic 
acid receptor-a (RARa) to either PLZF or to PML. These 
fusion oncoproteins have aberrant transcriptional repression 
activity resulting, in part, through the recruitment of a 
co-repressor and, in turn, HDACs . Grignani et al, 1998, 

30 Nature 321:815-818; Lin et al . , 1998, Nature 311:811-814. 

Treatment of PLZF- RARa APL cells with TSA enhances their 
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responsiveness to retinoic acid-induced differentiation. 
Grignani et al , 1998, Nature 191:815-818; Lin et al . , 1998, 
Nature 391 :811-814. 



The histone deacetylases comprise a large family of 
proteins, conserved from yeast to man, and are divided into 
two related classes. Class I is characterized by human 
HDAC1, 2, 3 (Taunton et al . , 1996, Science 272 :408-411 ; Yang 
et al., 1996, Proc. Natl. Acad. Sci . USA 93:12845-12850; 
Emiliani et al . , 1998, Proc. Natl. Acad. Sci. USA 95:2795- 
2800), and yeast RPD3 (Videl & Gaber, 1991, Mol . Cell. Biol. 
11:6317-6327), and class II by the human HDAC4 , 5, 6 
(Grozinger et al . , 1999, Proc. Natl. Acad. Sci. USA 9£:4868- 
4873 ; Fischle, et al . , 1999, J. Biol. Chem. 274:11713- 
11720), and yeast HDA1 (Rundlett et al . , 1996, Proc. Natl. 
Acad. Sci. USA 93:14503-14508). The two classes share a 
-3 90 amino acid region of sequence similarity, comprising 
the deacetylase core, but are divergent outside this region. 
The histone deacetylase genes belong to an even larger 
superfamily (Leipe & Landsman, 1997, Nucleic Acids Res. 
25:3693-3697) that contains the prokaryotic acetoin 
utilization proteins (AcuC; 28.1% sequence identity to 
HDAC1), and the prokaryotic acetylpolyamine amidohydrolases 
(APAH; 15.0 % sequence identity to HDAC1) . The enzymatic 
activity of AcuC is not clear, but its disruption reduces 
the ability of B. subtilis to breakdown acetoin and utilize 
it as a carbon source. Grundy et al . , 1993, Mol. Microbiol. 
10:259-271. APAHs catalyze the deacetylation of polyamines 
by cleaving a non-peptide amide bond (reviewed in Leipe & 
Landsman, 1997, Nucleic Acids Res . 25:3693-3697). 
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It is useful to address the questions of how HDACs and HDAC- 
related proteins catalyze the deacetylation of histones and 
how the above -referenced compounds, particularly those 
compounds with antitumor activity, inhibit this activity in 
5 order to better understand the mechanism of inhibition of 
HDACs and to facilitate discovery of additional useful 
compounds which may inhibit this activity. To this end, the 
present invention has determined the three dimensional 
structure of a HDACl-like protein from the thermophilic 

10 bacterium Aquifex aeolicus, herein after HDLP . The 
determination of the nucleic acid coding sequence of HDLP 
was described by Deckert et al . , 1998, Nature 392 : 353-358 . 
The encoded 375 residue protein, whose sequence was 
determined from the nucleic acid encoding sequence, shares 

15 35.2% amino acid sequence identity with HDAC1, deacetylates 
histones in vitro, and is inhibited by TSA, SAHA and several 
other HDAC inhibitors. The determination of the three- 
dimensional structure of HDLP is useful in the design, 
identification and screening of new HDAC family inhibitory 

20 compounds which are useful for the inhibition of cell growth 
both in vivo and in vitro. 



Summary of the Invention 

In general, it is the object of the present invention to 

2 5 provide detailed three-dimensional structural information 

for a family of proteins known as histone deacetylases 
(HDAC) , and particularly a homologue from the 
hyperthermophilic bacterium Aquifex aeolicus HDLP (histone 
deacetylase-like protein) which shares 35.2 % sequence 

3 0 identity with human histone deacetylase (HDAC1) . It is also 

an object of the present invention to provide three- 
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dimensional structural information of an HDLP bound to an 
inhibitory compound. 



In one embodiment of the invention, three-dimensional 
structure information is obtained from a crystal of wild- 
type HDLP (SEQ ID NO : 1 ) (the nucleic acid encoding wild- type 
HDLP is SEQ ID N0:2). In a further embodiment of the 
invention, three-dimensional information is obtained from a 
mutant HDLP comprising two mutations (l) cysteine 75 to a 
serine and (2) cysteine 77 to a serine (Cys75Ser/Cys77Ser 
double mutant; SEQ ID NO: 3) (the nucleic acid encoding HDLP 
Cys75Ser/Cys77Ser double mutant is SEQ ID NO: 4) . The HDLP 
mutant of the present invention facilitates the 
determination of three-dimensional structural information of 
HDLP bound to a zinc atom at its zinc atom-binding site. 

In a preferred embodiment of the invention, the three- 
dimensional structural information is obtained from a co- 
crystal of a protein- inhibitor compound complex that 
comprises HDLP or HDLP Cys75Ser/Cys77Ser double mutant and 
trichostatin A (TSA) . In another preferred embodiment of 
the invention the three-dimensional structural information 
is obtained from a co-crystal of a protein-inhibitor 
compound complex that comprises HDLP or HDLP 
Cys75Ser/Cys77Ser double mutant and suberoylanilide 
hydroxamic acid (SAHA) . Any HDLP or HDLP-related protein 
(e.g. HDAC) inhibitor compound that may be co-crystallized 
with HDLP may be used to form a co-crystal of the present 
invention. 

The protein crystals and protein- inhibitory complex co- 
crystals of the present invention diffract to a high 
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resolution limit of at least equal to or greater than 4 
angstrom (A) . In a preferred embodiment, the protein 
crystals and protein-inhibitory complex co-crystals of the 
present invention diffract to a high resolution limit of 
5 greater than 2.5 A. 



A crystal of the present invention may take a variety of 
forms, all of which are contemplated by the present 
invention. In a preferred embodiment, the crystal has a 

10 space group of C2 with one molecule in the asymmetric unit 
and with unit dimensions of a = 51.4 A, b = 93.8 A, c = 
78.7 A and P = 96.9° (see, e.g., Example 2, below). In 
another preferred embodiment, the crystal has a space group 
of P2 1 2 1 2 1 with two molecules in the asymmetric unit and with 

15 unit dimensions of a = 53.4 A, Jb = 94 . 4 A, c = 156.3 A (see, 
e.g., Example 2, below). The HDLP structure comprises a 
parallel 3 sheet with a helices packing against both faces. 
At one end of the 3 sheet, the HDLP has a narrow, tube -like 
pocket formed by several well-ordered loops. The walls of 

20 the pocket are lined with hydrophobic residues and there is 
a zinc binding site and several polar side chains at the 
bottom of the pocket. The inhibitory compounds of the 
present invention bind in the pocket. 

25 The three-dimensional structural information obtained from 
crystals of HDLP, HDLP Cys75Ser/Cys77Ser double mutant, HDLP 
Cys75Ser/Cys77Ser double mutant comprising a zinc atom, HDLP 
comprising an inhibitory compound such as TSA or -SAHA, and 
HDLP Cys75Ser/Cys77Ser dduble mutant comprising an inhibitor 

3 0 compound such as TSA or SAHA may be employed to solve the 
structure of any HDLP-related protein (e.g. HDAC) crystal, 
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or any mutant HDLP-related protein and particularly any wild 
type or mutant of HDLP-related protein complexed with a 
ligand, including a substrate or inhibitor compound. If the 
crystals are in a different space group than the known 
structure, molecular replacement may be employed to solve 
the structure, or if the crystals are in the same space 
group, refinement and difference fourier methods may be 
employed. The structure of HDLP-related proteins (e.g. 
HDAC1) comprise no greater than a 2.0 A root mean square 
deviation (rmsd) in the positions of the Ca atoms for at 
least 50% or more of the amino acids of the full-length HDLP 
structure . 

The present invention also provides a nucleic acid molecule 
encoding an HDLP Cys75Ser/Cys77Ser double mutant having the 
amino acid sequence of SEQ ID NO: 3 and the nucleic acid 
sequence of SEQ ID NO: 4. It is also contemplated by the 
invention that mutations be made in HDLP-related proteins at 
cysteine residues, as with the Cys75Ser/Cys77Ser double 
mutant, in order to facilitate the determination of the 
structure of said proteins bound to a zinc atom. 
Additionally, the present invention provides expression 
vectors which comprise the nucleic acid molecule encoding an 
HDLP Cys75Ser/Cys77Ser double mutant encoded by the sequence 
represented by SEQ ID NO: 4 operatively linked to expression 
control sequences. 

It is another object of the present invention to provide 
methods for the design, identification and screening of 
potential inhibitor compounds of the HDLP/HDAC family. m 
a preferred embodiment the method for the rational design. 
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identification and screening of potential inhibitor 
compounds for HDLP and HDLP-related proteins (e.g. HDACs) 
comprising deacetylase activity comprises the steps of: (a) 
using a three-dimensional structure of an HDLP as defined by 
5 the atomic coordinates of the present invention; (b) 
employing said three-dimensional structure to design or 
select said potential inhibitor compound; (c) synthesizing 
and/or selecting said potential inhibitor; (d) contacting 
said potential inhibitor compound with said enzyme in the 

10 presence of acetylated substrate; and (e) determining the 
percent inhibition of deacetylase activity to determine the 
inhibitory activity of said potential inhibitor compound. 
In a further preferred embodiment, the binding properties of 
• said rationally designed inhibitory compound may be 

15 determined by a method comprising the steps of: (a) forming 
a complex comprising said inhibitory compound and HDLP or a 
HDLP-related protein, (b) co-crystallizing said inhibitory 
compound-HDLP complex; (c) determining said three- 
dimensional structure of said co-crystal through molecular 

20 replacement or refinement and difference fourier with the 
molecular coordinates of HDLP as defined by the present 
invention; and (d) analyzing the three-dimensional structure 
to determine the binding characteristics of said potential 
inhibitor compound. 

25 

It is a further object of the present invention to identify 
a defined class of HDLP/HDAC family inhibitor compounds. 
The HDLP/HDAC family inhibitor compounds of the present 
invention are represented by formula (I) : 



30 
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(I) 




wherein X comprises a cap group which binds to at least one 
amino acid selected from the group consisting of proline and 
leucine; Y comprises an aliphatic chain group which binds to 
10 at least one amino acid selected from the group consisting 
of leucine, phenylalanine and glycine; and Z comprises and 
active site binding group which binds to at least one amino 
acid selected from the group consisting of aspartic acid, 
tyrosine and histidine and may further bind to a zinc atom. 

15 



20 



25 



30 
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Brief Description of the Drawings 

Figure 1 is a table listing the statistics from the X-ray 
crystallographic analysis of a HDLP crystal, a HDLP-TSA co- 
crystal, and a HDLP- SAHA co-crystal. 

5 

Figure 2 shows an alignment of various HDAC homologues with 
percent sequence identity depicted. 

Figure 3 shows a graph indicating the histone deacetylase 
10 activity of HDLP and HDAC1 and the inhibition of HDLP and 
HDAC1 by the inhibitors TSA and HC- toxin. 



i 



Figure 4 shows (A & B) a schematic representation of the 
HDLP-Zn 2+ -TSA complex in two approximately orthogonal views, 
15 (C) a topology diagram of HDLP indicating the regions of 
homology with HDAC 1 , and (D) a close-up schematic 
representation of the HDLP- Zn 2+ - SAHA complex. 



Figure 5 shows (A) a schematic representation of a slice 
20 through a surface representation of HDLP with the pocket 
internal cavities and position of the 3 sheet indicated, (B) 
a schematic representation of a close-up view of the active 
site looking down into the pocket in an orientation similar 
to Figure 4B. 

25 

Figure 6 shows (A) a space-filling representation of TSA in 
the active site pocket, (B) a closeup stereo view of the 
structure of the HDLP-ZN 2 + -TSA complex in a similar 
orientation to Figure 4B, and (C) a schematic representation 
30 of the HDLP-TSA interactions. 



WO 01/18045 PCT/US00/24700 

- 13 - 

Figure 7 shows (A) a schematic representation of the regions 
of homology shared between HDLP and HDAC1 in an orientation 
similar to that of Figure 4A, and (B) a detailed schematic 
representation of the homology shared in the pocket and 
internal cavity between HDLP and HDAC1 in an orientation 
similar to that of Figure 4B. 

Figure 8 shows a schematic representation of the proposed 
catalytic mechanism for the deacetylat ion of acetylated 
lysine . 

Figure 9 shows a schematic representation of a space filling 
diagram showing the conserved amino acids in the active site 
and nearby grooves . 

Figure 10 is the nucleic acid sequence of HDLP from Aquifex 
aeolicus (SEQ ID NO. 2) . 

Figure 11 is the amino acid sequence of full length HDLP 
from Aquifex aeolicus (SEQ ID NO. 1) . 

Figure 12 is the nucleic acid sequence of the HDLP active 
site mutant Tyr2 97Phe (SEQ ID NO. 6) . 

Figure 13 is the amino acid sequence of the HDLP active site 
mutant Tyr2 97Phe (SEQ ID NO. 5) . 

Figure 14 is the nucleic acid sequence of a double mutant of 
HDLP from Aquifex aeolicus comprising a Cys7 5Ser and 
Cys77Ser mutation (SEQ ID NO. 4) . 
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Figure 15 is the amino acid sequence of a double mutant of 

HDLP from Aquifex aeolicus comprising a Cys75Ser and 
Cys77Ser mutation (SEQ ID NO. 3) . 

5 Figure 16-1 to 16-49 lists the atomic structure coordinates 
for HDLP as derived by X-ray diffraction from a crystal of 
HDLP . 

Figure 17-1 to 17-49 lists the atomic structure coordinates 
10 for HDLP Cys75Ser/Cys77Ser double mutant comprising a zinc 
atom in the active site as derived by X-ray diffraction from 
a crystal of the HDLP Cys75Ser/Cys77Ser double mutant. 

Figure 18-1 to 18-99 lists the atomic structure coordinates 
15 for HDLP Cys75Ser/Cys77Ser double mutant as derived by X-ray 
diffraction from a co-crystal of HDLP complexed with TSA. 

Figure 19-1 to 19-48 lists the atomic structure coordinates 
for HDLP Cys75Ser/Cys77Ser double mutant as derived by X-ray 
20 diffraction from a co-crystal of HDLP complexed with SAHA . 



25 



30 
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Detailed Des cription of the Invention 

The present invention provides crystals of a histone 
deacetylase (HDAC) homologue grown in the presence and 
absence of a compound capable of inhibiting the histone 
deacetylase activity of said HDAC homologue. As referred to 
herein, a HDAC homologue (as well as a HDLP- related protein) 
is any protein molecule having (a) greater than 15% sequence 
identity to over the 375 amino acid residues of HDLP; (b) 
having no more than twenty insertions or deletions for a 
total of no more than 100 amino acids; and (c) deacetylase 
activity. Sequence identity is calculated by the program 
DNAstar'" using the identity matrix weighing scheme clustal 
method (DNAstar program, Madison, WI) . 

A HDLP/HDAC inhibitor compound, as used herein, refers to 
any compound represented by Formula (I) : 
(I) 



•(")' 
~ v — 

Y 



wherein X comprises a cap group which binds to at least one 
amino acid selected from the group consisting of tyrosine, 
proline and leucine; Y comprises an aliphatic chain group 
from about 5 to about 10 A, preferably 7 A, which binds to 
at least one amino acid selected from the group consisting 
of phenylalanine and glycine; and Z comprises a active site 
binding group which binds to at least one amino acid 
selected from the group consisting of aspartic acid, 
tyrosine and histidine and which may further bind to a zinc 
atom. The HDAC inhibitory compounds of the present 
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invention can inhibit greater than 50% of the histone 
deacetylase activity of a HDAC homologue or a HDLP-related 
protein . 

5 To grow the crystals of the present invention, the HDAC and 
HDAC- inhibitory compound complex are purified to greater 
than 80% total protein and more preferably purified to 
greater than 90% total protein. For expression and 
purification purposes, the full-length HDLP . (Genbank 
10 accession number AE000719) may be subcloned from Aquifex 
aeolicus chromosomal DNA preparation by the polymerase chain 
reaction (PCR) and inserted into an expression vector. 

A large number of vector-host systems known in the art may 

15 be used. Possible vectors include, but are not limited to, 
plasmids or modified viruses, but the vector system must be 
compatible with the host cell used. Examples of vectors 
include E. coli bacteriophages such as lambda derivatives, 
or plasmids such as pBR322 derivatives or pUC plasmid 

20 derivatives, e.g., pGEX vectors (Amersham- Pharmacia , 
Piscataway, New Jersey), pET vectors (Novagen, Madison, WI), 
pmal-c vectors (Amersham- Pharmacia, Piscataway, New Jersey) , 
pFLAG vectors (Chiang and Roeder, 1993, Pept. Res. 6:62-64), 
baculovirus vectors (Invitrogen, Carlsbad, CA; Pharmingen, 

25 San Diego, CA) , etc. The insertion into a cloning vector 
can, for example, be accomplished by ligating the DNA 
fragment into a cloning vector which has complementary 
cohesive termini, by blunt end ligation if no complementary 
cohesive termini are available or by through nucleotide 

30 linkers using techniques standard in the art.- E.g., 
Ausubel et al . (eds.), Current Protocols in Molecular 
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Biology, (1992) . Recombinant vectors comprising the nucleic 
acid of interest may then be introduced into a host cell 
compatible with the vector (e.g. E. coli, insect cells, 
mammalian cells, etc.) via transformation, transf ection, 
infection, electroporation, etc. The nucleic acid may also 
be placed in a shuttle vector which may be cloned and 
propagated to large quantities in bacteria and then 
introduced into a eukaryotic cell host for expression. The 
vector systems of the present invention may provide 
expression control sequences and may allow for the 
expression of proteins in vitro. 

In a preferred embodiment, the full length HDLP (SEQ ID 
NO:2) is subcloned from Aquifex aeolicvs chromosomal DNA 
preparation into P GEX4T3 (Amersham- Pharmacia , Piscataway, 
New Jersey). m order to construct a double mutant 

comprising a Cys75Ser and Cys77Ser mutation (SEQ ID N0:4), 
and to construct the HDLP active site mutant Tyr297Phe (SEQ 
ID NO:5 and SEQ ID N0:6), PGR site directed mutagenesis may 
be employed with verification by DNA sequencing by methods 
known to those skilled in the art (see, e.g.. Example 1 
below). The mutants of the present invention may be 
subcloned into a suitable expression vector and introduced 
into a host cell for protein production, as described above. 

The HDLP nucleic acids of the present invention may be 
subcloned into an expression vector to create an expression 
construct such that the resultant HDLP molecule which is 
produced comprises a fusion protein wherein said fusion 
protein comprises a tag for ease of purification. As 
referred to herein, a "tag" is any additional amino acids 
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which are provided in a protein either c- terminally , n- 
terminally or internally for the ease of purification, for 
the improvement of production or for any other purpose which 
may facilitate the goals of the present invention (e.g. to 
5 achieve higher levels of production and/or purification) . 
Such tags include tags known to those skilled in the art to 
be useful in purification such as, but not limited to, his 
tag, glutathione-s-transf erase tag, flag tag, mbp (maltose 
binding protein) tag, etc. In a preferred embodiment, the 

10 wild-type and mutant HDLPs of the present invention are 
tagged with glutathione-s-transf erase (see Example 1 below) . 
In another preferred embodiment, HDAC1 is flag tagged (see 
Example 1 below) . Such tagged proteins may also be 
engineered to comprise a cleavage site, such as a thrombin, 

15 enterokinase or factor X cleavage site, for ease of removal 
of the tag before, during or after purification. Vector 
systems which provide a tag and a cleavage site for removal 
of the tag are particularly useful to make the expression 
constructs of the present invention. 

20 

The tagged HDLPs and HDACs of the present invention may be 
purified by immuno-af f inity or conventional chromatography, 
including but not limited to, chromatography employing the 
following : glutathione-sepharose™ (Amersham- Pharmacia, 

25 Piscataway, New Jersey) or an equivalent resin, nickel or 
cobalt -purification resins, anion exchange chromatography, 
cation exchange chromatography, hydrophobic resins, gel 
filtration, antiflag epitope resin, reverse phase 
chromatography, etc. After purification, the HDLP and HDLP- 

3 0 inhibitor compound complex may be concentrated to greater 
than 1 mg/ml for crystallization purposes. In a preferred 
embodiment HDLP and HDLP- inhibitor complexes are 
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concentrated to greater than 10 mg/ml for crystallization 
and in a particularly preferred embodiment, HDLP and HDLP- 
inhibitor complexes are concentrated to greater than 20 
mg/ml . 



In order to determine whether the purified HDLPs of the 
present invention demonstrate histone deacetylase activity, 
the purified HDLPs and also any HDLP- related protein may be 
assayed by any method known to those skilled in the art for 
the determination of said activity. in a preferred 
embodiment, the purified HDLPs of the present invention are 
incubated in the presence of [ 3 H] acetyl -labeled histone 
substrate (Carmen et al . , 1996, J. Biol. chem. 221:15837- 
15844) in a buffer suitable for detection of histone 
deacetylase activity (see Example 3 below); stopping the 
reaction; extracting the released acetate and measuring said 
released acetate, as described by Henzel et al . (J. Biol. 
Chem. 266:21936-21942 (1991); Example 3 below). In a 
preferred embodiment, the HDLPs of the present invention are 
inclubated in the presence of ZnCl 2 in order to obtain 
histone deacetylase activity therefrom (Example 3 below) . 

In another embodiment, the crystals of the present invention 
comprise purified wild-type HDLP (SEQ ID NO:l) and are grown 
at room temperature by the hanging-drop vapor-diffusion 
method from a crystallization solution comprising one or 
more precipitants selected from the group consisting of 
isopropanol, polyethylene glycol, and tert butanol (see 
Example 2 below). The 'crystallization solution may further 
comprise one or more salts including salts selected from the 
group consisting of NaCl and KCl, and one or more buffers 
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including buffers selected from the group consisting of Tris 
(tris (hydroxymethyl) ami nome thane and bis-tris propane-Cl 
(1 , 3-bis [tris (hydroxymethyl) methyl -amino] propane) (see 
Example 2 below) . The pH of the crystallization solution is 
5 preferably between pH 5 to 9, although other pH values are 
also contemplated by the present invention (see Example 2 
below) . 

Any crystallization technique known to those skilled in the 
10 art may be employed to obtain the crystals of the present 
invention, including, but not limited to, batch 
crystallization, vapor diffusion (either by sitting drop or 
hanging drop) and micro dialysis. Seeding of the crystals 
in some instances may be required to obtain X-ray quality 
15 crystals. Standard micro and/or macro seeding of crystals 
may therefore be used. 



The crystals of the present invention may form in the space 
group C2 with one molecule in the asymmetric unit and with 

20 unit dimensions of a = 51.4 A, b = 93.8 A, c = 78.7 A and 
(5 = 96.9° (see Example 2 below). The crystals of the 
present invention may also form in the space group P2 1 2 1 2 1 
with two molecules in the asymmetric unit and with unit 
■ dimensions of a = 53.4 A, b = 94.4 A, c = 156.3 A (see 

25 Example 2 below) . However, the present invention 

contemplates crystals which form in any space group 
including, but not limited to, C2 , P2 lt P2 1 2 1 2 1 , 93^21, 
P4 3 2 1 2 1 , and C222j. The crystals diffract to a resolution 
greater than 4 A, preferably greater than 2.5 A. 

• 30 



To collect diffraction data from the crystals of the present 
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invention, the crystals may be flash-frozen in the 
crystallization buffer employed for the growth of said 
crystals, however with preferably higher precipitant 
concentration (see, e.g., Example 2 below). For example, 
but not by way of limitation, if the precipitant used was 
28% PEG 1500, the crystals may be flash frozen in the same 
crystallization solution employed for said crystal growth 
wherein the concentration of the precipitant is increased to 
35% (see Example 2 below) . if the precipitant is not a 
sufficient cryoprotectant (i.e. a glass is not formed upon 
flash-freezing), cryoprotectants (e.g. glycerol, low 
molecular weight PEGs, alcohols, etc) may be added to the 
solution in order to achieve glass formation upon flash- 
freezing, providing the cryoprotectant is compatible with 
preserving the integrity of the crystals. The flash- frozen 
crystals are maintained at a temperature of less than -110°C 
and preferably less than -150°C during the collection of the 
crystallographic data by X-ray diffraction. The X-ray 
diffraction data may be processed with DENZO and SCALE PACK 
(Otwinowski & Minor, 1997, Method Ensemble. 2_7£:307-326) but 
any method known to those skilled in the art may be used to 
process the X-ray diffraction data. 

In order to determine the atomic structure of HDLP according 
to the present invention, multiple isomorphous replacement 
(MIR) analysis, model building and refinement may be 
performed. For MIR analysis, the crystals may be soaked in 
heavy-atoms to produce heavy atom derivatives necessary for 
MIR analysis. As used herein, heavy atom derivative or 
derivitization refers to the method of producing a 
chemically modified form of a protein or protein complex 



WO 01/18045 PCT/US00/24700 

- 22 - 

crystal wherein said protein is specifically bound to a 
heavy atom within the crystal. In practice a crystal is 
soaked in a solution containing heavy metal atoms or salts, 
or organometallic compounds, e.g., lead chloride, gold 
5 cyanide, thimerosal , lead acetate, uranyl acetate, mercury 
chloride, gold chloride, etc, which can diffuse through the 
crystal and bind specifically to the protein. The 
location (s) of the bound heavy metal atom(s) or salts can be 
determined by X-ray diffraction analysis of the soaked 

10 crystal. This information is used to generate MIR phase 
information which is used to construct the three-dimensional 
structure of the crystallized HDLPs and HDLP-related 
proteins of the present invention. In a preferred 
embodiment, the heavy atoms comprise thimerosal, KAu(CN) 2 and 

15 Pb(Me) 3 OAc (see Example 2 below). The MIR phases may be 
calculated by any program known to those skilled in the art 
and preferably with the program MLPHARE (The CCP4 suite: 
Programs for computational crystallography, 1994, Acta 
Crystallogr . D. 150:760-763) and may also use the anomalous 

20 diffraction signal from the thimerosal derivative. In a 
preferred embodiment, the MIR phases were calculated at 2.5 
A and have a mean figure of merit of 0.55 (see Figure 19 and 
Example 2 below) . The phases may be improved where 
necessary by solvent flattening by methods known to those 

25 skilled in the art including, but not limited to, through 
the use of the program DM (The CCP4 suite: Programs for 
computational crystallography, 1994, Acta Crystallogr . D 
50 : 760-763) . 

30 Thereafter, an initial model of the three-dimensional 
structure may be built using the program O (Jones et al . , 



W ° 01/18045 PCT/US00/24700 

- 23 - 

1991, Acta Crystallogr. A 47 : 110-119) . The interpretation 
and building of the structure may be further facilitated by 
use of the program CNS (Brunger et al . , 1998, Acta 
Crystallogr. D £4:905-921) . 

For the determination of the HDLP- inhibitor compound complex 
structure, if the space group of the HDLP- inhibitor compound 
complex crystal is different, molecular replacement may be 
employed using a known structure of apo-HDLP ( as referred 
to herein, apo-HDLP or apo-HDAC is the enzyme which is not 
complexed with an inhibitor compound) or any known 
HDLP/inhibitor complex structure whose structure may be 
determined as described above and below in Example 2. if 
the space group of the HDLP- inhibitor compound crystals is 
the same, then rigid body refinement and difference fourier 
may be employed to solve the structure using a known 
structure of apo-HDLP ( as referred to herein, apo-HDLP or 
apo-HDAC is the enzyme which is not complexed with an 
inhibitor compound) or any known HDLP/inhibitor complex 
structure . 



The term "molecular replacement" refers to a method that 
involves generating a preliminary model of the three- 
dimensional structure of the HDLP crystals of the present 
invention whose structure coordinates are unknown prior to 
the employment of molecular replacement. Molecular 
replacement is achieved by orienting and positioning a 
molecule whose structure coordinates are known (in this case 
the previously determined apo-HDLP) within the unit cell as 
defined by the X-ray diffraction pattern obtained from an 
HDLP or HDLP-related protein crystal whose structure is 
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unknown so as to best account for the observed diffraction 
pattern of the unknown crystal . Phases can then be 
calculated from this model and combined with the observed 
amplitudes to give an approximate Fourier synthesis of the 
5 structure whose coordinates are unknown. This in turn can 
be subject to any of several forms of refinement to provide 
a final, accurate structure. 

Any method known to the skilled artisan may be employed to 
10 determine the structure by molecular replacement. For 
example., the program AMORE (The CCP4 suite: Programs for 
computational crystallography, 1994, Acta Crystallogr. D. 
150:760-763) may be employed to determine the structure of an 
unknown histone deacetylase +/- an inhibitor by molecular 
15 replacement using the apo-HDLP coordinates (Figure 16) . For 
the structure determination of the inhibitory compound TSA, 
the structure of TSA was obtained from the Cambridge 
Structural Database (Refcode TRCHST , << 

http://www.ccdc.cam.ac.uk >>) may be employed to define the 
20 stereochemical restraints used in the refinement with the 
program CNS (Brunger et al . , 1998, Acta Crystallogr. D 
54 : 905-921) . 

The three-dimensional structural information and the atomic 
25 coordinates associated with said structural information of 
HDLP are useful for solving the structure of crystallized 
proteins which belong to the HDAC family by molecular 
replacement. Similarly, any structure of a crystallized 
protein which is thought to be similar in structure based on 
3 0 function or sequence similarity or identity to HDLP may be 
solved by molecular replacement with the HDLP structural 
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information of the present invention. The structure of 
HDLP-related proteins as determined by molecular replacement 
as described above and in Example 2 below, comprise a root 
mean square deviation (rmsd) of no greater than 2.0 A in the 
positions of Ca atoms for at least 50% or more of the amino 
acids of the structure over the 375 residues of full-length 
HDLP. Such a rmsd may be expected based on the amino acid 
sequence identity. Chothia & Lesk, 1986, Embo J. 5:823-826. 

The refined three-dimensional HDLP structures of the present 
invention, specifically apo-HDLP, Cys75Ser/Cys77Ser double 
mutant HDLP comprising a zinc atom in the active site, 
HDLP/TSA complex comprising a zinc atom in the active site, 
and HDLP /S AHA complex comprising a zinc atom in the active 
site, are represented by the atomic coordinates set forth in 
Figures 16 to 19 respectively. The refined model for apo- 
HDLP comprising amino acids 1-375 consists of wild-type HDLP 
residues 2 to 373 with residues 1, 374 and 375 not modeled 
and presumed disordered and was determined to a resolution 
of 1 - 8 A - Similarly, the refined model for 

Cys75Ser/Cys77Ser double mutant HDLP comprising a zinc atom 
in the active site also consists of residues 2 to 373 with 
residues 1, 374 and 375 not modeled and presumed disordered 
and was determined to a resolution of 2.0 A. The refined 
model for the HDLP/TSA complex comprising a zinc atom in the 
active site consists of the Cys75Ser/Cys77Ser double mutant 
HDLP residues 2 to 373 with residues 1, 374 and 375 not 
modeled and presumed disordered, has TSA in the binding 
pocket and was determined to a resolution of 2.1 A. The 
HDLP/SAHA complex is similar to the HDLP/TSA complex but has 
SAHA in the binding pocket and was determined to a 
resolution of 2.5 A. 



WO 01/18045 PCT/US00/24700 

- 26 - 

For the purposes of further describing the structure of HDLP 
and HDLP-related proteins, including, but not limited to, 
HDACs , from the data obtained from the HDLP crystals of the 
present invention, the definition of the following terms is 
provided : 

The term "3 sheet" refers to two or more polypeptide chains 
(or p strands) that run alongside each other and are linked 
in a regular manner by hydrogen bonds between the main chain 
C=0 and N-H groups. Therefore all hydrogen bonds in a 
beta- sheet are between different segments of polypeptide. 
Most p-sheets in proteins are all-parallel (protein 
interiors) or all-antiparallel (one side facing solvent, the 
other facing the hydrophobic core) . Hydrogen bonds in 
antiparallel sheets are perpendicular to the chain direction 
and spaced evenly as pairs between strands. Hydrogen bonds 
in parallel sheets are slanted with respect to the chain 
direction and spaced evenly between strands. 

The term "a helix" refers to the most abundant helical 
conformation found in globular proteins. The average length 
of an a helix is 10 residues. In an a helix, all amide 
protons point toward the N- terminus and all carbonyl oxygens 
point toward the C- terminus .. The repeating nature of the 
phi, psi pairs ensure this orientation. Hydrogen bonds 
within an a helix also display a repeating pattern in which 
the backbone C=0 of residue X (wherein X refers to any amino 
acid) hydrogen bonds to the backbone HN of residue X+4 . 
The a helix is a coiled structure characterized by 3.6 
residues per turn, and translating along its axis 1.5 A per 
amino acid. Thus the pitch is 3.6x1.5 or 5.4 A. The screw 
sense of alpha helices is always right-handed. 
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The term "loop" refers to any other conformation of amino 
acids (i.e. not a helix, strand or sheet). Additionally, a 
loop may contain bond interactions between amino acid side 
chains, but not in a repetitive, regular fashion. 

Amino acid residues in peptides shall herein after be 
abbreviated as follows: Phenylalanine is Phe or F; Leucine 
is Leu or L; Isoleucine is He or I; Methionine is Met or M; 
Valine is Val or V; Serine is Ser or S; Proline is Pro or P; 
Threonine is Thr or T; Alanine is Ala or A; Tyrosine is Tyr 
or Y; Histidine is His or H; Glutamine is Gin or Q ; 
Asparagine is Asn or N; Lysine is Lys or K; Aspartic Acid is 
Asp or D; Glutamic Acid is Glu or E; Cysteine is Cys or C; 
Tryptophan is Trp or W; Arginine is Arg or R ; and Glycine is 
Gly or G. For further description of amino acids, please 
refer to Proteins: Structure and Molecular Properties by 
Creighton, T.E., W.H. Freeman & Co. , New York 1983. 

The term "positively charged amino acid" refers to any amino 
acid having a positively charged side chain under normal 
physiological conditions. Examples of positively charged 
amino acids are Arg, Lys and His. The term "negatively 
charged amino acid" refers to any amino acid having a 
negatively charged side chain under normal physiological 
conditions. Examples of negatively charged amino acids are 
Asp and Glu. The term "hydrophobic amino acid" refers to 
any amino acid having an uncharged, nonpolar side chain that 
is relatively insoluble in water. Examples of hydrophobic 
amino acids are Ala, Leu,- He, Gly, Val, Pro, Phe, Trp and 
Met. The term "hydrophilic amino acid" refers to any amino 
acid having an uncharged, polar side chain that is 
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relatively soluble in water. Examples of hydrophilic amino 
acids are Ser, Thr, Tyr, Asp, Gin, and Cys . The term 
"aromatic amino acid" refers to any amino acid comprising a 
ring structure. Examples of aromatic amino acids are His, 
5 Phe, Trp and Tyr. 

The term "charge relay system" refers to a His-Asp 
arrangement as described by Fersht & Sperling, 1973 , «J. Mol . 
Biol. 74:137-149; Blow et al . , 1969, Nature 221 :337-340 . 

10 

The information obtained from the three-dimensional 
structures of the present invention reveal that HDLP has a 
single-domain structure that belongs to the open a/3 class 
of folds (see, e.g., Branden, 1980, Q. Rev. Biophys . 

15 13:317-38) . Two orthogonal views of the overall three- 
dimensional structure of HDLP are depicted in Figure 4A and 
4B. The HDLP structure has a central eight - stranded 
parallel 3 sheet (strands arranged as 32-31-33-38-37-34-35- 
36) , and sixteen a helices (labeled al through al6 

20 respectively) . See Figure 4C. Four of the helices pack on 
either face of the 3 sheet (a7, a8, a9, alO and all, al2, 
al3, al4) forming the core a/3 structure characteristic of 
this class of folds. Most of the remaining eight helices 
are positioned near one side of the 3 sheet, near stands 32- 

25 31-33-38. Large, well defined loops (Loops L1-L7; Figure 
4C) originate from the C-terminal ends of the 3~strands. 
The extra helices and the large L1-L7 loops are associated 
with a significant extension of the structure beyond the 
core a/3 motif. This extension of the structure gives rise 

30 to two prominent architectural features: a deep,, narrow 
pocket and an internal cavity adjacent to the pocket. These 
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two architectural features comprise the active site (see 
Figure 5A) . The structure of HDLP-related proteins (e.g. 
HDACs) may also comprise the conserved a/3 structure 
characteristic . 

The term "active site" comprises any or all of the following 
sites in HDLP, the substrate binding site, the site where 
the cleavage of an acetyl group from a substrate occurs or 
the site where an inhibitor of the HDAC family or, more 
particularly, HDLP binds. The active site, as referred to 
herein, comprises Aspl66, Asp258, Hisl70, Tyr297, Hisl31, 
Hisl32, Aspl68, Aspl73, Phel41, Phel98, Leu265, Pro22 and 
Glyl4 0, and also a metal bound at the bottom of the pocket 
by Aspl73, Aspl68 and His defined by the coordinates listed 
in Figures 16 to 19 with an rmsd of 2.0 A. The metal which 
binds at the bottom of the pocket will be a divalent cation 
selected from the group consisting of zinc, cobalt or 
manganese . 

The deep narrow pocket has a tube -like shape with a depth of 
- 11 A. The pocket opening constricts half way down to - 
4.5 by 5.5 A, and becomes wider at the bottom (see Figure 
5A) . The pocket and its immediate surroundings are made up 
of loops LI through L7 . 

The walls of the pocket are covered with side chains of 
hydrophobic and aromatic residues (Pro22, Tyr91 near the 
entrance; and Glyl40, Phel41, Phe 198, Leu2 65 and Tyr297 
further down; Figure 5B) . For numbering of amino acids 
please refer to SEQ ID N0:1. of particular interest are 
Phel41 and Phel98, whose phenyl groups face each other in 
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parallel at a distance of 7.5 A, marking the most slender 
portion of the pocket (see Figure 5B) . Of particular 
interest is that only one pocket residue differs in HDAC1 
when the sequences are aligned (alignment may be 
5 accomplished using DNAstar™ MegAlign™ program, Madison, WI) , 
this residue is Glu98 of HDAC1 which is Tyr91 in HDLP. The 
structure reveals that this residue in HDLP is mostly 
solvent exposed. 

10 Near the bottom of the pocket of the active site at its 
narrowest point, is located a zinc ion (see Figure 6A) . In 
order to obtain the zinc in the structure, the crystals may 
be soaked in zinc (e.g. ZnCl 2 ) or co-crystalized in the 
presence of zinc. The zinc ion is coordinated by Aspl68 

15 (051, 2.1 A), Hisl70 (NSl, 2.1 A), Asp258 (051, 1.9 A) and 

a water molecule (2.5 A). See Figure 5B and 6B. The amino 
acid residues that coordinate zinc are arranged in a 
tetrahedral geometry, but the position of the water 
molecule, which is also hydrogen bonded to Hisl31, deviates 

20 from this geometry by -25°. 



In addition to the zinc ligands, the bottom of the pocket 
contains two histidine (Hisl31 and Hisl32) , two aspartic 
acids (Aspl66 and Aspl73) and a tyrosine (Tyr297). See 

25 Figure 5B and 10B. Each of the histidines makes a hydrogen 
bond through its N51 to an aspartic acid carboxylate oxygen, 
with the oxygen located in the plane of the imidizole ring 
(Figure 5B) . This His-Asp arrangement is characteristic of 
the charge relay system present in the active sites of 

30 serine proteases, where it serves to polarize the imidizole 
Ne and increase its basicity. Fersht & Sperling, 1973, J. 
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Mol. Biol. 74:137-149; Blow et al . , 1969, Nature 221:337- 
340 . 



The Aspl66-Hisl31 charge pair relay (hereafter referred to 
as "buried charged relay") is positioned even deeper in the 
pocket and more buried compared to the Aspl73 -Hisl32 charge 
relay (hereafter referred to as "exposed charge relay") 
which is partially solvent exposed. The buried charge relay 
makes a hydrogen bond (2.6 A) to the zinc-bound water 
molecule referred to above, and this hydrogen bond could 
contribute to the deviation of the water- zinc coordination 
from ideal geometry (Figure 5B) . The exposed charge relay 
is directed to a point - 2.5 A away from the water molecule 
and closer to the surface. 

Tyr 297 is positioned next to the zinc, opposite from where 
the two charge relay systems are located. The Tyr hydroxyl 
group lies 4.4 A away from the zinc atom and has no 
interactions with the rest of the protein (Figure 5B) . Next 
to Tyr297, there is an opening in the pocket wall, which 
leads to the adjacent internal cavity. 

The floor of the internal cavity is made up of portions of 
the L3 and L7 loops as they emerge from the (3 strands, and 
the roof is made up by the al-Ll-a2 segment. The LI loop 
appears more flexible than other loops in the structure. 
This may allow the transient exchange of the cavity contents 
with the bulk solvent. 

The cavity is lined primarily with hydrophobic residues and 
is particularly rich in glycine residues (Alal27, Glyl28, 
Glyl29, Metl30, and Phel41 of L3 ; Gly293, Gly294, Gly295 and 
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Gly296 of L7; and Tyrl7, Pro22 and Leu23 of LI) . There are 
only two charged residues in the cavity (Arg27 and His 21) 
and these are contributed by the LI loop. 

5 The cavity may provide space for the diffusion of the 
acetate product away from the catalytic center, which may 
otherwise be crowded and shielded during deacetylat ion from 
the solvent when the substrate is bound. Such a role for 
the cavity is supported by the observation that the cavity 

10 contains three water and two isopropanol molecules (from the 
crystallization buffer) in the 1.8 A apo-protein structure. 
The cavity may also bind another cof actor, in addition to 
zinc, for the facilitation of the enzymatic activity of the 
HDLP. A proposed catalytic mechanism for deacetylat ion is 

15 provided in Figure 8. 

The structure of HDLP as defined by the present invention, 
in conjunction with the HDAC1 sequence homology, shows that 
the 375-amino acid HDLP protein corresponds to the histone 

20 deacetylase catalytic core which is conserved across the 
HDAC family (see Figure 2). The 35.2% HDLP-HDAC1 sequence 
identity predicts structural similarity with a rmsd in Ca 
positions of ~ 1.5 A. Chothia and Lesk describe the 
relation between the divergence of sequence and the 

25 structure of proteins in Embo J. 5:823-826 (1986) . The 40- 
residue C-terminus of HDLP is likely to have a divergent 
structure since this region has lower homology to HDAC1 , 
although the al6 helix in this region is part of the 
conserved open a/3 core fold and HDAC1 is likely to comprise 

30 a similar helix. However divergent this C- terminal region 
may be, this region is outside the active site and is likely 
to not effect the structure of the active site. Beyond the 
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C-terminus of the histone deacetylase catalytic core, HDAC 
family members are divergent in length and sequence. in the 
HDAC family, this region (amino acid residues -390-482) i s 
highly polar, populated with acidic residues, and is likely 
to be flexible or loosely folded. 

The HDLP -HDAC homology maps primarily to the hydrophobic 
core and to the L1-L7 loops, with portions of the loops that 
make up the pocket and adjacent cavity having the highest 
level of amino acid residue sequence conservation (Figure 9A 
and 9B) . Specifically, all of the polar residues in the 
active site (the zinc ligands, the two charge relay systems, 
and Tyr297) and the hydrophobic residues that make up the 
walls of the pocket (Glyl40, Phel41, Phel98 and Leu265) are 
identical. Among the residues that make up the internal 
cavity, the ones closest to the active site are either 
identical or conservatively substituted (for example, Leu23 
-> Met and Metl30-> Leu) . Surface residues around the pocket 
are conserved to a lesser extent, but are still above 35% 
average sequence identity. 

The information obtained from the inhibitor-bound HDLP 
complex crystal structures of the present invention reveal 
detailed information which is useful in the design, 
isolation, screening and determination of potential 
inhibitor compounds which may inhibit HDLP/HDAC family 
members. As described above, the HDLP structure consists of 
a parallel 3 sheet with a helices packing against both faces 
(Figure 4A, 4B, and 4C) . At one end of the (S sheet, 7 loops 
(L1-L7) form a narrow, tube-like pocket which are lined with 
hydrophobic residues and which comprise a zinc binding site, 
several polar side chains, including two Asp-His charge 
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relay systems. Mutation of the zinc ligands and other polar 
residues at the pocket bottom reduces or eliminates the 
catalytic activity. 

5 The present inventors found that mutation at the Tyr297Phe 
site reduced activity. See also, Hassig et al . , 1998, Proc . 
Natl. Acad. Sci . USA 95:3519-3524; Kadosh & Struhl, 1998, 
Genes Dev. 12.: 797-805. The elimination of activity by 
mutation of these residues indicates that this region is the 
10 enzyme active site. Adjacent to the active site, there is 
an internal cavity that may provide space for the diffusion 
of the acetate reaction product. Homology at the active 
site between HDLP and HDAC1, as described above, indicates 
that they share structural and functional homology. 

15 

The inhibitor compound, trichostatin A (TSA) (Tsuji et al . , 
1976, J. Antibiotics 2^:1-6) binds HDLP by inserting its 
long aliphatic chain, which has a hydroxamic acid group at 
one end, into the pocket (Figure 6A, 6B and 6C) . The 

20 aliphatic chain makes multiple contacts in the well-like, 
hydrophobic portion of the pocket. The hydroxamic acid 
reaches the polar bottom of the pocket, where it coordinates 
the zinc in a bidentate fashion and also forms hydrogen 
bonds with the polar residues in the active site, including 

25 the two charge relay system histidines. The aromatic 
dimethylamino-phenyl group at the other end of the TSA chain 
makes contacts at the pocket entrance and serves to cap it. 
The amino acid residues of HDLP which contact TSA are 
conserved in HDAC, indicating that TSA binds and inhibits 

30 HDAC in a similar fashion to HDLP. 
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In the complex, the hydroxamic acid, most of the aliphatic 
chain and part of the dimethylamino-phenyl group of TSA are 
buried (60% of TSA's surface area; Figure 6A) . The 
hydroxamic acid group binds the zinc in a bidentite fashion 
forming bonds through its carbonyl (2.4 A) and hydroxyl 
groups (2.2 A) resulting in a pent a -coordinated Zn 2+ (Figure 
6B and 6C) . The hydroxamic acid hydroxyl group replaces the 
water molecule that binds to the zinc in the apo-HDLP 
structure described above. The hydroxamic acid also 
hydrogen bonds with both charge relay system histidines 
(hydroxyl oxygen to Hisl31 Ne2, 2.8 A; and nitrogen to 
Hisl32 Ne2, 2.8 A), and the Tyr297 hydroxyl group (2.4 A; 
Figure 6B and 6C) . 

The 5-carbon long branched alkene chain of TSA fits snugly 
in the narrow portion of the pocket making multiple van der 
Waals contacts with all of the hydrophobic groups lining the 
pocket (Figure 6B and 6C) . Near its center, the chain 
contains a methyl substituted carbon-carbon double bond 
which is sandwiched between the phenyl groups of the Phel41 
and Phe98 at the tightest point of the pocket (Figure 6A and 
6B) . The length of the alkene chain appears optimal for 
spanning the length of the pocket, and allowing contacts 
both at the bottom and at the entrance of the pocket, 
although, the cap group of Formula (I) may provide length to 
span the pocket allowing for a shorter alkene chain 
(aliphatic chain) . 

At the entrance of the pocket, one face of the planar 
structure formed by the dimethylamino-phenyl and adjacent 
carbonyl groups of TSA makes contacts at the rim of the 
pocket (Pro22, Tyr91, Phel41; Figure 6B and 6C) . This 
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packing is facilitated by the roughly 110° angle in the 
overall structure of TSA at the junction of the aliphatic 
chain and the dimethylamino-phenyl group (occurring at the 
sp 3 hybridized C8 carbon) . Upon TSA binding, the side chain 
of Tyr91, which is mostly solvent exposed, changes 
conformation to make space for the dimethylamino-phenyl 
group. This is the only change near the active site 
observed upon TSA binding. 

The hydroxamic acid group is a common motif in zinc 
metalloprotease inhibitors. See U.S. Patent No. 5,919,940 
and 5,917,090; See also, Grams et al . , 1995, Biochemistry 
14:14012-14020; Lovejoy et al . , 1999, Nat. Struct. Biol. 
6:217-221; and Holmes & Matthews, 1981, Biochemistry 
2JD: 6912-6920 . Like TSA, these inhibitors also coordinate 
the active site zinc in a bidentate fashion using their 
hydroxamate hyroxyl and carbonyl oxygens, replace the 
nucleophilic water molecule with their hydroxamate hydroxyl 
groups and form hydrogen bonds to the general base (Grams et 
al . , 1995, Biochemistry 34. : 14012 - 14020 ; Lovejoy et al . , 
1999, Nat. Struct. Biol. 6:217-221; and Holmes & Matthews, 
1981, Biochemistry 2£: 6912-6920) . 

SAHA, which has a -30- fold weaker inhibitory activity than 
TSA (Richon et al . , 1998, Proc. Natl.. Acad. Sci . USA 
95:3003-3007), binds HDLP similarly to TSA (see, e.g., 
Figure 4D) . The SAHA hydroxamic acid group makes the same 
contacts to the zinc and active site residues, and the 
importance of these, interactions is underscored by the loss 
of activity of SAHA derivatives lacking the hydroxamic group 
(Richon et al . , 1998, Proc. Natl. Acad. Sci. USA 9^:3003- 
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3007) . The six-carbon long aliphatic chain of SAHA packs in 
the tube-like hydrophobic portion of the pocket. Compared 
to TSA however, SAHA's aliphatic chain packs less snugly and 
makes fewer van der waals contacts, in part, because SAHA 
lacks TSA's C15 methyl group branch. SAHA also lacks TSA's 
double bonds in this region, and this may lead to increased 
flexibility of the aliphatic chain. The cap group of SAHA 
consists of a phenyl -amino ketone group. In the crystal 
structure, the phenyl group has weak electron density, 
suggesting that it does not pack as well as the cap group of 
TSA. This may be due to the larger separation between the 
hydroxamic and cap groups of SAHA compared to TSA (compare 
TSA, Formula (II) and SAHA, Formula (III), below). 



(ID 




(III) 
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The determination of the structure of HDLP and HDLP bound to 
an inhibitory compound has enabled, for the first time, the 
identification of the active site of HDLP and of related 
HDLP proteins, such as proteins belonging to the HDAC 
5 family. 

The three-dimensional structural information and the atomic 
coordinates associated with said structural information of 
HDLP bound to an inhibitory compound is useful in rational 

1.0 drug design providing for a method of identifying inhibitory 
compounds which bind to. and inhibit the enzymatic activity 
of HDLP, HDAC family proteins and other histone deacetylase- 
like proteins related to HDLP. Said method for identifying 
said potential inhibitor for an enzyme comprising 

15 deacetylase activity comprises the steps of (a) using a 
three-dimensional structure of HDLP as defined by its atomic 
coordinates listed in Figure 16 to 19; (b) employing said 
three-dimensional structure to design or select said 
potential inhibitor; (c) synthesizing said potential 

20 inhibitor; (d) contacting said potential inhibitor with said 
enzyme in the presence of an acetylated substrate; and (e) 
determining the ability of said inhibitor to inhibit said 
deacetylase activity . 

25 The potential HDLP and HDLP-related (e.g. HDAC) inhibitors 
identified by the method of the present invention are 
represented by formula (I) 



30 
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(I) 



Y 

wherein X comprises a cap group which binds to at least one 
amino acid selected from the group consisting of proline and 
leucine; Y comprises an aliphatic chain group which binds to 
at least one amino acid selected from the group consisting 
of leucine, phenylalanine and glycine; and Z comprises an 
active site binding group which binds to at least one amino 
acid selected from the group consisting of aspartic acid, 
tyrosine and histidine and wherein Z may further bind to a 
zinc atom and with the provision that the compound of 
Formula (I) is not TSA, trapoxin, SAHA, SAHA derivatives 
described in U.S. Patent Nos . 5,608,108; 5,700,811; 
5,773,474; 5840,960 and 5,668,179. 



The present invention permits the use of molecular design 
techniques to design, identify and synthesize chemical 
entities and compounds, including inhibitory compounds, 
capable of binding to the active site of HDLP and HDLP- 
related proteins. The atomic coordinates of apo-HDLP and 
inhibitor-bound HDLP may be used in conjunction with 
computer modeling using a docking program such as GRAM, 
DOCK, HOOK or AUTODOCK (Dunbrack et al . , 1997, Folding & 
Design 2:27-42) to identify potential inhibitors of HDLP and 
HDLP-related proteins (e.g. HDAC1) . This procedure can 
include computer fitting of potential inhibitors to the 
active site of HDLP to ascertain how well the shape and the 



WO 01/18045 PCT7US00/24700 

- 40 - 

chemical structure of the potential inhibitor will 
complement the active site or to compare the potential 
inhibitors with the binding of TSA or SAHA in the active 
site. See Bugg et al, 1998, Scientific American 
5 December : 92 -98; West et al . , 1995, TIPS 16:67-74. The 
potential inhibitors designed by modeling with a docking 
program conform to the general formula (I) as described 
above. Computer programs may also be employed to estimate 
the attraction, repulsion and stearic hindrance of the HDLP 

10 and potential inhibitor compound. Generally, the tighter 
the fit, the lower the stearic hindrances, the greater the 
attractive forces, and the greater the specificity which are 
important features for a specific inhibitory compound which 
is more likely to interact with HDLP and HDLP-related 

15 proteins rather than other classes of proteins. These 
features are desired particularly where the inhibitory 
compound is a potential antitumor drug. 

The compounds of the present invention may also be designed 
20 by visually inspecting the three-dimensional structure to 
determine more effective deacetylase inhibitors. This type 
of modeling may be referred to as "manual" drug design. 
Manual drug design may employ visual inspection and analysis 
using a graphics visualization program such as "O" (Jones, 

2 5 T.A. , Zhou, J.Y., Cowan, S.W., and Kjeldgaard, M., Improved 

method for building protein models in electron density maps 
and the location of errors in these models, Acta 
Crystallog., A47, 110-119. 

3 0 Initially potential inhibitor compounds can be selected for 

their structural similarity to the X, Y and Z constituents 
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of formula (I) by manual drug design. The structural analog 
thus designed can then be modified by computer modeling 
programs to better define the most likely effective 
candidates. Reduction of the number of potential candidates 
is useful as it may not be possible to synthesize and screen 
a countless number of variations compounds that may have 
some similarity to known inhibitory molecules. Such 
analysis has been shown effective in the development of HIV 
protease inhibitors (Lam et al . , 1994, Science 262: 380-384; 
Wlodawer et al . , 1993, Ann. Rev. Biochem. 62:543-585; 
Appelt, 1993 Perspectives in Drug Discovery and Design 1:23- 
48; Erickson, 1993, Perspectives in Drug Discovery and 
Design 1:109-128. Alternatively, random screening of an 
small molecule library could lead to potential inhibitors 
whose inhibitory activity may then be analyzed by computer 
modeling as described above to better determine their 
effectiveness as inhibitors. 

The compounds designed using the information of the present 
invention may be competitive or noncompetitive inhibitors. 
These designed inhibitors may bind to all or a portion of 
the active site of HDLP and may be more potent, more 
specific, less toxic and more effective than known 
inhibitors for HDLP and HDLP-related proteins, and 
particularly HDACs . The designed inhibitors may also be 
less potent but have a longer half life in vivo and/or in 
vitro and therefore be more effective at inhibiting histone 
deacetylase activity in vivo and/or in vivo for prolonged 
periods of time. Said designed inhibitors are useful to 
inhibit the histone deacetylase activity of HDLP and HDLP- 
related proteins (e.g. HDAC1) , to inhibit cell growth in 
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vitro and in vivo and may be particularly useful as 
antitumor agents. 

The present invention also permits the use of molecular 
5 design techniques to computationally screen small molecule 
data bases for chemical entities or compounds that can bind 
to HDLP in a manner analogous to the TSA and SAHA as defined 
by the structure of the present invention. Such 
computational screening may identify various groups which 

10 may be defined as M X" , "Y" or "Z" of formula (I) above and 
may be employed to synthesize the potential inhibitors of 
the present invention comprising formula (I) . Such 
potential inhibitors may be assayed for histone deacetylase. 
inhibitory activity in a histone deacetylase activity assay 

15 (see Example 3 below) , may be co-crystallized with HDLP to 
determine the binding characteristics through X-ray 
crystallography techniques defined above (e.gr. said co- 
crystal structure may be determined by molecular replacement 
to assess the binding characteristics of said potential 

20 inhibitor) , or may be assessed based on binding activity by 
incubating said potential inhibitor with said HDLP, 
performing gel filtration to separate any free potential 
inhibitor to HDLP-bound inhibitor, and determining the 
amcpunt of histone deacetylase activity of the inhibitor- 

25 bound HDLP. To measure binding constants (e.g., Kd) , 
methods known to those in the art may be employed such as 
Biacore™ analysis, isothermal titration calorimetry, Elisa 
with a known drug on the plate to show competitive binding, 
or by a deacetylase activity assay. 

30 
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The design of potential inhibitors of the present invention 
is further facilitated by reference to Figure 9, which is a 
surface representation figure that depicts the surface 
grooves. Analysis of such grooves gives insight into the 
constituents of the cap group of formula (I). The surface 
grooves are labeled groove A, groove A', groove B and groove 
C, into which additional cap groups may bind. The structure 
of HDLP bound to either TSA or SAHA shows that the cap 
groups of TSA and SAHA bind in groove A. By analysis of the 
amino acid sequence identity of HDLP and HDACs , Groove A is 
well conserved in HDACs , has a significant hydrophobic 
component, appears deep enough to allow for significant 
interactions and is also the largest of the four grooves. 
In addition to the dimethylamino phenyl group of the TSA, 
the A groove can fit approximately 200 daltons worth of 
groups (e.g. groove A could accommodate a naphthalene-like 
group after an appropriate spacer, etc.). Groove A, as 
referred to herein, is characterized by the following 
conserved residues of HDLP: His 21, Pro22, Lys24, Phel41, 
Leu265 and Phe335. The periphery of groove A comprises 
unconserved residues. Additionally, Groove A', as referred 
to herein, comprises primarily unconserved residues. 

Groove B is immediately adjacent to the pocket. of 
significance is that the bottom of groove B comprises the N- 
epsilon nitrogen of Hisl70, which coordinates the zinc 
through its N-delta nitrogen. Significant binding energy 
may be achieved by contacting the Ne proton of Hisl70 with 
a carboxylic acid or sulfate group. In addition, groove B 
may be large enough to fit a phenyl group, the face of which 
may comprise a partial negative charge which may pack over 
the N-epsilon proton of Hisl70. The conserved residues of 
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groove B, as referred to herein are: Hisl70, Tyrl96 and 
Leu265. 



Groove C is not as well conserved as the other two grooves 
and the amino acid residues which comprise groove C are 
mostly polar and solvent exposed. Groove C, as referred to 
herein comprises the following conserved residues: Asn87, 
Glyl40 and Phel98 . 

The compounds of the present invention are represented by 
formula (I) : 
(I) 




Examples for suitable X constituents wherein X comprises a 
cap group may be described in three categories, depending 
upon which surface of groove A, A', B and/or C they are 
targeted to. The cap group may comprise all three 
categories on the same compound. Of particular benefit may 
be replacing the cap group of TSA or SAHA with a large, 
rigid structure. Nonlimiting examples for suitable cap 
groups (X) of formula (I) which may bind in groove A are: 
(1) attaching a 1-3 methyl linker followed by a phenyl or 
naphthalene group from the para or meta position of SAHA's 
phenyl group represented by formula (IV) : 
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(IV) 



n<H 2 C) <^ ^ ^ 



0 



(2) attaching a 2-3 methyl linker followed by a phenyl or 
naphthalene group from the meta position of TSA's phenyl cap 
group, or from TSA's dimethyl amino group represented by 
formula (V) : 
(V) 



phenyl or 
napthalene 



phenyl or 
napthalene 




and which may bind in groove 
followed by a carboxylate, 
represented by formula (VI): 
(VI) 



is a 1-3 methyl group spacer 
sulfate or phenyl group as 



-COOH 
•SO, 



(CH 2 )n R t R J 




\ / 



With respect to the aliphatic (Y) group, the diameter of the 
pocket suggests that one more methyl "side chain" could fit, 
in addition to the CIS methyl group on the CIO carbon. 
Nonlimiting suitable examples for Y constituents wherein Y 
comprises an aliphatic chain group are as follows: (1) add 



15 
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a methyl group to TSA on the C12 carbon (with or without a 
methyl group on the CIO carbon and with or without double 
bonds and with or without substituting the X and/or Z 
constituents of formula (Das represented by formula (VII): 
(VII) 



(is) 

I 

(branch here) 



0. 



10 (2) add a methyl group to TSA on the C9 carbon (with or 
without a methyl group on the CIO carbon; with or without 
both or either of the double bonds, and with or without 
substituting the X and/or Z constituents of formula (I) as 
represented by formula (VIII) : 



(VIII) 



i« ( 15 ) , (Branch here) 

I 



(3) replace the two alkalene double bonds of TSA with only 
one between CIO and Cll, which may free the Cll and C12 
torsion to allow for a better fit, the X and/or Z groups may 
also be substituted as represented by formula (IX) : 
25 (IX) 




(,5) 
I 



30 
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(4) cyclize CIS and C12 carbons of TSA through a sulphur 
atom (or nitrogen atom) , the X and/or Z groups may also be 
substituted as represented by formula (X) : 

(X) 




on 



(5) extend from the C9 carbon of TSA such that the extensi 
approaches and/or enters groove B (see Figure 9) ; making C9 
s P 3 so that it can have some freedom; attach to C9 a 1-3 
methyl group spacer which may include a double bond and they 
attaching thereto a sulfate, carboxylate, sulfate, hyroxyl , 
or phenyl group which may make an interaction with the N- 
epsilon proton of Hisl70 which may coordinate the zinc atom 
as represented by formula (XI) : 
(XI) 



(is) 



i 

(CH 2 )n 



(6) extend off the C8 carbon (replacing C14) of TSA such 
that the extension approaches or enters groove B; attach a 
1-3 methyl group spacer (which may include a double bond) 
and then link thereto a carboxylate, sulfate, hydroxyl or 
phenyl group such that an interaction is made with the N- 
epsilon proton of Hisl70 that coordinates the zinc atom; the 
X and/or Z constituents may also be substituted as 
represented by formula (XII) : 
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<9 H ?>n (is) 



-COOH 

-so 4 

-OH 




(7) substitute the C8 carbon at the end of the aliphatic 
chain such that the substitution may contact groove A, A', B 
10 and or C, in such an example, a cap group (X) may or may not 
be required and the X and Z constituents may be substituted 
as well, as represented by formula (XIII): 
(XIII) 



15 



20 



25 




(8) formulas VII through XIII above wherein the aliphatic 
chain further comprises a methyl group between the active 
site binding group (Z) and the C8 carbon, and preferably 
just before the C8 carbon, increasing the distance between 
X and Z, (9) make the connection between the aliphatic chain 
and the cap group more rigid (e.g., by closing a 6-membered 
ring which may or may not comprise oxygen, the X and Z group 
may also be substituted as represented by formula (XIV) : 
(XIV) 



30 
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and (10) combining two or more of the changes depicted by 
formulas (VII -XIV) . 

Additionally, nonlimiting examples for suitable Z groups 
wherein Z comprises an active site binding group are as 
follows: (1) hydroxamic acid, (2) carboxylic acid, (3) 
sulfonamide, (4) acetamide, (5) epoxyketone, (6) an ester 
with a methyl linker and a hydroxyl of acetate ester group 
to lead into the cavity and interact with a conserved 
arginine (Arg27) as represented by formula (XV) : 
(XV) 



CH,)* II 



/ 
\ 

O CH 3 



and (7) an alphaketone as represented by formula (XVI) 
(XVI) 




/ "h R,=« 
Ri 



-(CH 2 )n OH 

// 

-<CH 2 )n 

O CH 3 



Additionally, other suitable X, Y and Z constituents may be 
envisioned by the skilled artisan given the three- 
dimensional structural information of the present invention. 

After having determined potential suitable X, Y and Z 
constituents, the constituents are combined to form a 
compound of formula (I) us i ng combinatorial chemistry 
techniques. This may be achieved according to U.S. Patent 
Nos. 5,608,108; 5,700,811; 5,773,474; 5,840,960 and 
5,668,179, incorporated herein by reference. Any methods 
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known to one of skill in the art may be employed to 
synthesize compounds of formula (I) comprising X, Y and Z 
constituents as determined by the methods described above. 

5 As mentioned above, the compounds of formula (I) are useful 
to inhibit the histone deacetylase activity of HDLP and 
HDAO related proteins. Such inhibition may allow for a 
reduction or cessation of cell growth in vitro and in vivo. 

10 For in vitro use, such reduction or cessation of cell growth 
is useful to study the role of histone deacetylation and 
differentiation during the cell cycle and also to study 
other mechanisms associated with cell cycle arrest and 
particularly how the repression of transcription is involved 

15 in cell cycle progression which may be studies in a yeast 
model system such as that described by Kadosh & Struhl, 
1998, Mol. Cell. Biol. 18:5121-5127. In vitro model systems 
which may be employed to study the effects of potential 
inhibitors on cell cycle progression and also tumor growth 

20 include those described by: Richon et al , 1998, Proc . Natl. 

Acad. Sci. USA 95 :3003-3007; Yoshida et al. f 1995, Bioessays 
17:423-430; Kim et al . , 1999 , Oncogene 18 : 24 61 -24 70 ; Richon 
et al., 1996, Proc. Natl. Acad. Sci. USA 93:5705-5708; and 
Yoshida et al . , 1987, Cancer J^es. 47.: 3.688-3691 . 

25 

For in vivo use, such a reduction or cessation of cell 
growth is useful to study the effect of said inhibitor 
compounds in non-human animal model systems of cancer and is 
also useful for the treatment of cancer in a recipient in 
30 need of such treatment. Non-limiting examples of animals 
which may serve as non-human animal model systems include 
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mice, rats, rabbits, chickens, sheep, goats, cows, pi gs , and 
non-human primates. See, e.g., Desai et al . , 1999, Proc. 
AACR 40: abstract #2396; Cohen et al . , 1999, Cancer R es ., 
submitted. The compounds of the present invention may be 
administered to a transgenic non-human animal wherein said 
animal has developed cancer such as those animal models in 
which the animal has a propensity for developing cancer 
(e.g. animal model systems described in U.S. Patents 
5,777,193, 5,811,634, 5,709,844, 5,698,764, and 5,550,316). 
Such animal model systems may allow for the determination of 
toxicity and tumor reduction effectiveness of the compounds 
of the present invention. 

A preferred compound of the present invention may comprise 
high specific activity for HDLP and HDAC-related proteins, 
good bioavailability when administered orally, activity in 
reducing or ceasing cell growth in tumor cell lines, and 
activity in reducing or ceasing tumor growth in animal 
models of various cancers. 

Accordingly, another aspect of this invention is a method of 
eradicating or managing cancer in a recipient, which may be 
an animal and is preferably a human. Said method comprises 
administering to said recipient a tumor reducing amount of 
a compound as defined by formula (I) above, or a 
physiological acceptable salt thereof. 

In a further aspect of the invention, there is provided a 
composition comprising the compound of formula (I) and an 
excipient or carrier. Administration of the foregoing 
agents may be local or systemic. Such carriers include any 
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suitable physiological solutions or dispersant or the like. 
The physiologic solutions include any acceptable solution or 
dispersion media, such as saline, or buffered saline. The 
carrier may also include antibacterial and antifungal 
5 agents, isotonic and absorption delaying agents, and the 
like. Except insofar as any conventional media, carrier or 
agent is incompatible with the active ingredient, its use in 
the compositions is contemplated. 

10 Routes of administration for the compositions containing the 
delivery vehicle constructs of the present invention include 
any conventional and physiologically acceptable routes, such 
as, for example, oral, pulmonary, parenteral (intramuscular, 
intraperitoneal, intravenous (IV) or subcutaneous 

15 injection) , inhalation (via a fine powder formulation or a 
fine mist), transdermal, nasal, vaginal, rectal, or 
sublingual routes of administration and can be formulated in 
dosage forms appropriate for each route of administration. 

20 The following examples are provided to more clearly 
illustrate the aspects of the invention and are not intended 
to limit the scope of the invention. 

EXAMPLES 

25 Example 1: Protein Production and Purification : 

Full-length wild-type HDLP (Genbank accession number 
AE000719) was subcloned from an Aquifex aeolicus chromosomal 
DNA preparation (provided by Robert Huber of Universitaet of 
Regensburg, Germany) into the pGEX4T3 (Amersham- Pharmacia, 

30 Piscataway, NJ) vector using the polymerase chain reaction 
(PCR) . The cysteine-to-serine and active site mutants were 
constructed by PCR site directed mutagenesis and were 
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sequenced. The HDLP-glutathione S-transf erase (GST) fusion 
protein was produced in Escherichia coli, purified by 
affinity chromatography using a column of 
glutathione-sepharose resin (Amersham- Pharmacia, Piscataway, 
NJ) , and by anion-exchange chromatography (Q-sepharose m ; 
Amersham- Pharmacia, Piscataway, NJ) . HDLP was cleaved from 
the fusion protein with thrombin at 4° C, was purified by 
anion-exchange (Q- sepharose'"; Amersham- Pharmacia , 

Piscataway, NJ) and gel filtration chromatography 
(Superdex ,M 20 0; Amersham- Pharmacia, Piscataway, NJ) , and was 
concentrated to typically 25 mg/ml in a buffer of 25 mM 
bis-tris propane (BTP) , 500 mM NaCl , 5 mM dithiothrietiol 
(DTT) , 2% isopropanol, pH 7.0. 

Although, it is not known what metal cofactor HDLP contains 
in vivo, it is presumed to be zinc because of the 
arrangement of the ligands and the similarities in the 
active site to the zinc proteases. The lack of metal in the 
purified HDLP is presumed due, in part, to the use of DTT 
during purification. HDLP was reconstituted with Zn 2+ by 
mixing the Cys75Ser/Cys77Ser double mutant at 10 mg/ml with 
a 5-fold molar excess of ZnCl 2 in a buffer of 25 mM bis-tris 
propane, 200 mM NaCl, 1% isopropanol, pH 7.0. Unbound ZnCl 2 
was removed by fractionating HDLP through a G25 desalting 
column (Amersham- Pharmacia, Piscataway, NJ) . The 

HDLP -Zn 2+ - TSA complex was prepared by incubating the Zn 2+ 
reconstituted HDLP mutant with 1 mM TSA for 45 minutes, 
followed by gel filtration chromatography (Superdex-200; 
Amersham- Pharmacia, Piscataway, NJ) to remove, excess TSA, 
and concentration to typically 25 mg/ml in a buffer of 25 mM 
bis-tris propane, 500 mM NaCl, 1% isopropanol, pH 7.0. 
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FLAG epitope tagged human HDAC1 was overexpressed using a 
baculovirus expression system in Hi5 (Invitrogen, Carlsbad, 
CA) insect cells grown in suspension in serum-free media 
(Sf900, Gibco, Grand Island, NY). The fusion protein was 
5 purified by anion exchange and affinity chromatography using 
Ant i -FLAG M2 affinity resin (Sigma, St. Louis, MO) and FLAG 
Peptide (Sigma, , St. Louis, MO). 

Example 2: Crystallization and data collection : 
10 Crystals of apo-HDLP were grown at room temperature by the 
hanging-drop vapor-diffusion method, from 7.5% isopropanol, 
28% PEG 1500, 425 mM NaCl , 100 mM Tris-Cl, pH 7.0. They 
form in space group C2 with a = 51.4 A, b = 93.8 A, c = 78.7 
A, 3 = 96.9 A, and contain one HDLP molecule in the 
15 asymmetric unit. Diffraction data were collected with 
crystals flash-frozen in a buffer of 7.5% isopropanol, 35% 
PEG 1500, 75 mM NaCl, 100 mM Tris-Cl, pH 8.0, at -170° C. 

The structure of the HDLP- Zn 2+ complex was determined from 
20 HDLP Cys75Ser/Cys77Ser double mutant crystals grown from 23% 
tert-butanol, 27% PEG 1500, 400 mM KC1 , 100 mM bis-tris 
propane-Cl, pH 6.8. Space group and cell dimensions were 
identical to the apocrystals. The HDLP-Zn 2+ crystals were 
harvested and frozen in 27% tert-butanol, 22% PEG 1500, 50 
25 mM KC1, 20 mM NaCl, 0.2 mM ZnCl 2 , 100 mM bis-tris propane, 
pH 6.8, at -170° C. 

Crystals of the HDLP-Zn 2+ -TSA complex comprised HDLP 
Cys75Ser/Cys77Ser double mutant and were grown from 23% 
30 tert-butanol, 27% PEG 1500, 600 mM KCl , 100 mM bis-tris 
propane-Cl, pH 6.8, by microseeding . The crystals were 
grown in the presence of zinc. They form in space group 
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P2 1 2 1 2 1 with a = 53.4 A, b = 94.4 A, c = 156.3 A and contain 
two HDLP- Zn 2 + -TSA complexes in the asymmetric unit. The 
HDLP-Zn 2+ -TSA crystals were harvested and frozen in the same 
cryobuffer as the HDLP- Zn 2+ crystals except that 0 . 5mM TSA 
was added. Data were processed with DENZO and SCALEPACK 
(Otwinowski & Minor, 1997, Method. Ensemble. 276:307-326). 
MIR analysis, model building and refinement. 

The HDLP-Zn 2 + -SAHA complex crystals were grown and evaluated 
the same as the HDLP -Zn 2+ - TSA crystals. However, the 
restraints for the SAHA structure were constructed based on 
stereochemical parameters from TSA. Like the apo-HDLP 
crystals, the SAHA/HDLP co-crystals grew in space group C2 . 

Heavy-atom soaks were performed with the apo-HDLP crystals 
in a buffer of 7.5% isopropanol, 30% PEG 1500, 75 mM NaCl , 
100 mM Tris-Cl, pH 8.0, supplemented with 1 . 0 mM thimerosal 
for 2h, 5 mM KAu(CN) 2 for lh, and 1 mM Pb(Me) 3 OAc for 2h. 
MIR phases were calculated with the program MLPHARE (The 
CCP4 suite: Programs for computational crystallography, 
1994, Acta Crystallogr. D 50:760-763) at 2.5 A using the 
anomalous diffraction signal from the thimerosal derivative, 
and had a mean figure of merit of 0.55. The phases were 
improved by solvent flattening with the program DM (The CCP4 
suite: Programs for computational crystallography, 1994, 
Acta Crystallogr. D 50:760-763) , and were used to build 
the initial model with the program O (Jones et al . , 1991, 
Acta Crystallogrr. A 47:110-109). Successive rounds of 
rebuilding and simulated annealing refinement with the 
program CNS (Brunger et al . , 1998, Acta Crystallogr. D 
54:905-921) allowed interpretation of HDLP from residues 2 
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to 373. Residues 1, 374, and 375 were not modeled and are 
presumed to be disordered. 

The structure of the HDLP-Zn 2 +-TSA and HDLP-Zn 2 *-SAHA complex 
5 were determined by molecular replacement with the program 
AMORE (The CCP4 suite: Programs for computational 
crystallography, 1994, Acta Crystallogr. D 50 : 760- 763 ) using 
the apo-HDLP structure as a search model. The initial 
electron density maps had strong and continuous difference 

10 density for the entire r TSA molecule. However the SAHA 
molecule was not as well ordered in the cap group region. 
The structure of TSA was obtained from the Cambridge 
Structural Database (Refcode TRCHST) and was used to define 
stereochemical restraints used in the refinement with the 

15 program CNS. The restraints of SAHA were constructed based 
on stereochemical parameters from TSA and surrounding amino 
acid residues. The dimer interface in the HDLP-Zn 2 *-TSA and 
HDLP-Zn 2+ -SAHA crystals primarily involves Phe200 on the 
protein surface. The Phe200 side chain contacts Tyr91, 

2 0 whose side chain conformation changes on TSA binding, and 
part of the dimethyl amino phenyl group of TSA from the 
second protomer. The HDAC family does not contain a 
phenylalanine residue at the equivalent position. 

2 5 Example 3: Histone deacetylase assays : 

Purified proteins were assayed by incubating 10 //g of 
[ 3 H] acetyl -labeled murine erythroleukemia histone substrate 
and HDAC assay buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl , 
10% glycerol) for 30-60 minutes at 37° C in a total volume 

30 of 30 ill. The final concentrations of HDLP and HDAC 1 - FLAG 
were 3.6 /iM and 0.24 \M, respectively. Assays were 
performed in duplicate. The reactions were stopped and the 
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released acetate was extracted and assayed as described 
(Hendzel et al . , 1991, J. Biol. Chem. 266:21936-21942). 
[3H] acetyl -labeled murine erythroleukemia histones were 
prepared essentially as described (Carmen et al . , 1996, j. 
Biol. Chem. 271:15837-15844). Inhibitors were added in the 
absence of substrate and incubated on ice for 20 minutes, 
substrate was added, and the assay performed as described 
above. HDLP was inclubated with 20 jiM ZnCl 2 and 20 /iM 
MnCl 2 (H20) 4 in HDAC buffer and tested for activity, 

Only HDLP dialyzed against ZnCl 2 had activity. HDAC1-FLAG 
was dialyzed against 20 fiM ZnCl 2 in HDAC buffer which had no 
effect on activity. Therefore, HDAC1-FLAG contains a metal 
as purified. 

The in vivo substrate of HDLP is not known. HDLP may have 
a role in acetoin utilization like the B. subtilis AcuC gene 
product, and it has been annotated as such in the genome 
sequence, but the reaction catalyzed by AcuC is also not 
known. Furthermore, the A. aeolicus genome appears to lack 
the acuA and acuB genes that are part of the acuABC operon 
of B. subtilis (Deckert et al . , 1998 Nature 392:353-358), 
and HDLP is as similar to human HDAC1 (35.2 % identity) as 
it is to B. subtilis AcuC (34.7 % identity). 
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What is claimed is : 

1. A crystal of an enzyme comprising deacetylase activity 
wherein, said crystal effectively diffracts X-rays for 
the determination of the atomic coordinates of said 
enzyme to a resolution of greater than 4 A and wherein 
the structure of said enzyme comprises a conserved 
core a/3 structure characteristic fold wherein said 
conserved a/3 fold comprises an eight -stranded parallel 
3 sheet and eight a helices and wherein four of the 
helices pack on either face of said parallel 3 sheet 
and wherein said structure of said enzyme comprises an 
rmsd of less than or equal to 1 . 5 A in the positions of 
Ca atoms for at least 2/3 or more of the amino acids of 
HDLP as defined by the atomic coordinates of HDLP . 

2. The crystal of claim 1, wherein said protein structure 
further comprises: 

(a) eight a helices positioned near one side of the 3 
sheet ; and 

(b) at least seven large, well defined loops 
originating from the C-terminal ends of the 3~ 
strands of said eight -stranded parallel 3 sheet 
wherein the eight extra helices and the seven 
large loops are associated with a significant 
extension of the structure beyond the core a/ 3 
motif and wherein said extension of the structure 
gives rise to a deep, narrow pocket and an 
internal cavity adjacent to the pocket. 



3. The crystal of claim 1, wherein said enzyme comprising 
deacetylase activity is selected from the group 
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consisting of HDLP, HDLP-related proteins, HDACl, 
HDAC2, HDAC3 , HDAC4 , HDAC5 , HDAC6 , HDAC-related 
proteins, APAH, AcuC, and functional derivatives 
thereof . 



The crystal of claim 2 further comprising a 
specifically bound zinc atom in the active site of said 
enzyme . 

The crystal of claim 2 further comprising a 
specifically bound deacetylase inhibitor compound in 
the active site of said enzyme. 

The crystal of claim 2 define by the atomic coordinates 
according to Figure 16 . 

A method for identifying a potential deacetylase 
inhibitor compound for an enzyme which comprises 
deacetylase activity, said method comprising the steps 
of: 

a. using a three-dimensional structure of HDLP as 
defined by atomic coordinates according to Figure 
16; 

b. employing said three-dimensional structure to 
design or select said potential inhibitor; 

c. synthesizing said potential inhibitor; 

d. contacting said potential inhibitor with said 
enzyme in the presence of an acetylated substrate; 
and 

e. determining the deacetylase inhibitory activity of 
said potential inhibitor. 
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8. The method of claim 7, wherein the three-dimensional 
structure is designed or selected using computer 
modeling . 

9. The method of claim 7, wherein the potential 
deacetylase inhibitor is designed de novo. 

10. The method of claim 7, wherein the potential 
deacetylase inhibitor is designed based on a known 
inhibitor . 

11. The method of claim 7, wherein said enzyme comprising 
deacetylase activity is selected from the group 
consisting of HDLP, HDLP-related proteins, HDAC1, 
HDAC2, HDAC3, HDAC4 , HDAC5 , HDAC6 , HDAC-related 
proteins, APAH, and AcuC. 

12. A method of evaluating the binding properties of the 
potential deacetylase inhibitor compound comprising the 
steps of: 

a. co-crystallizing said compound with HDLP; 

b. determining the three-dimensional structure of 
said HDLP-potential inhibitor complex co-crystal 
by molecular replacement using the three- 
dimensional structure of HDLP as defined by atomic 
coordinates according to Figure 16; and 

c. analyzing said three-dimensional structure of said 
HDLP bound to said potential inhibitor compound to 
evaluate the binding characteristics of said 
potential inhibitor compound. 

13. A method for solving the structure of an HDAC family 
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member crystal comprising the steps of: 
a. collecting X-ray diffraction data of said crystal 
wherein said data diffracts to a high resolution 
limit of greater than 4 A; 
5 b. using the atomic coordinates of HDLP accoding to 

Figure 16 to perform molecular replacement or 
refinement and difference fourier with said X-ray 
diffraction data of said HDAC family member 
crystal to determine the structure of said HDAC 
10 family member; and 

c. refining said, structure of said HDAC family 
member. 

14. The method of claim 13, wherein said HDAC family member 
15 is HDAC1 . 

15. A Cys75Ser/Cys77Ser double mutant of HDLP wherein said 
mutant is encoded by the nucleic acid sequence of SEQ 
ID NO: 4. 



20 



25 



16. A Cys75Ser/Cys77Ser double mutant of HDLP wherein said 
mutant has the amino acid sequence of SEQ ID NO: 3. 

17. A nucleotide sequence according to SEQ ID NO: 4 

18. An expression vector comprising the nucleotide sequence 
of claim 17. 



19. A method of using the crystal of claim 1 for screening 
3 0 for a novel drug comprising: 



a. selecting a potential ligand by performing 
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rational drug design with the three-dimensional 
structure determined for the crystal; 

contacting the potential ligand with the ligand 
binding domain of the crystal ; and 

detecting the binding potential of the potential 
ligand for the ligand binding domain, wherein the 
novel drug is selected based on its having a 
greater affinity for the ligand binding domain 
than that of a known drug. 
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Figure 4 
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Figure 4d 
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Figure 6 
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Figure 1U 

10 20 30 40 

■ ■ ■ ■ I ■ i ■ ■ I ■ i ■ ' * i ' i ■ I i » i ■ 1 ■ ■ ' ■ I 1 ■ ■ ■ 1 ■ ■ ■ 1 1 
ATGAAGAAGGlTrAAACTIAICGGAACT^ 40 
ACTACAGATATCCCAAAAACCATCXTIU^ 80 
AGTITCXXTACTCCTrA«n^ 120 
ATAGATCAGAAGGAATTA^ 160 
AAGAAGAACTCCTITrAITCX^ACAQ 200 

210 220 230 240 

• ■ ' t I i ■ i i I ■ i ■ ■ I ■ i ■ ■ I ■ ■ ■ ■ I ■ i ■ ■ 1 i i i i I i i i i I 

CATITrAAT3GAAGCX3GAAAQGriGTC 240 

QGAGCHAGOGAAAAGTACAACATAQQCGjATA^ 280 

CCXJrATCITACQCXSATGTITACAQQ^ 320 

GGGITCAACAGlGCAGGCGATAGAGSAAr^ 360 

AATCTAGCTITCAATCG33CX3GGAQGTA 400 

410 . 420 430 440 

■ ■ ' ■ I ■ i ' ' I ■ i ■ ■ 1 ■ ■ ■ 1 I ■ » 1 1 1 ■ i 1 1 I 1 1 1 1 1 ■ 1 ■ ■ I 
TTAAAAGCAGGQC^AACGGCITITOCTA^ 440 
03C7IGIGG3AATIGAGTACTIGAGAAAAAAAGQ 480 
AGAATACICTACATAGAOCTIGATGa^ 520 
GriGTICAGGAAGCCriTIAeGA^ 560 
OCTCICCCTICACCAGTa^ 600 

610 620 630 640 

■ ■'■ I t ■ i ■ I i i ■ ■ I ■ ■ ■ ■ I ■ ■ i ■ I ■■ i t I ■ ■ ■ i I ■ i i i I 

GAGAAGGGCTTCCIQGAGGAGATAGGAGAAGGAAAAGGA^ 640 
AGGQCTACAACCIGAACATICCC^^ 680 

ATAGTCAAAGAAGIATTIGAGCCC^ 760 
AACTCX33AACTGAC02ACTCX?ITGA 800 

810 820 830 840 

■ ■ ' 1 1 » ■ ■ 1 1 1 ■ ■ ■ I ' ■ ■ ■ I i i i i I i 1 1 1 1 i ■ ■ i l i ■ i i I 
GOTCAACCTCIOAACXTTIG^ 840 
ATCXJTTQCTGAQGTITIXDGQGGAGGGA^ 880 
GAGGCGGATACCATCCTrACGCXXTCGCAAGGGCA 920 
CCrAATCTTOGIGOGAGCTriXDGG^ 960 
AAGCTAAACAATAAAGCAAAAGAGCTITTAAAGAGrrATAG 1000 

1010 1020 1030 1040 

' ■ ' ■ I ■ ' ' ■ I ' ' ■ ■ I i i i i I i i 1 1 I i i i 1 1 1 1 i 1 1 i i i i I 

ACTTTCAAGAGUTllACGACGAGGTOGACOQC^ 1040 

GCTCX5AAAOCXI7rAAAGGACCCCT33AGAG3AQG 1080 

AGGAAAGAAGTAAAGGATAOSCTTCAAAAQG 1120 
CATCTTA 1127 
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Figure 1 1 
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1 ■ 1 ■ I ■ ■ 1 ' I ' ■ 1 ■ * ■ 1 u ■ I ■ ■ ■ ■ 1 I ■ ' I I ■ ' ' ' I ■ ■ ■ ■ I 

MKKVKIJGIIIJYGKY^ 40 

TnFKFl .TK.crepaTKTTT J ,t .fhtt^pvttvttt. MFAEPOCT^/PK 80 

GAREKYNIQGftEsIWSY^^ 120 

NVMNPAQGMHHAFKSRAN^^ 160 

RILYIDIJDAHHCIXM?^^ 200 

210 220 230 240 

' » ' ■ I i ■ ■ ■ I ' ■ ■ ■ I ■ i i i I ■ ■ * ■ * i i ■ ■ I ■ ■ ■ ■ l ' ■ < ■ I 
EKGFluEEIGEnKGKGY^ 240 

TVKKVFRPK\m I QT fTTD PT ,T .Trn VT .QK"FMT .cqv^^-pf .prflTTM 280 

n^VFraram ryr^YHPYAT awtt .TO m- .cre^/pp 320 
KU^MCAKELIJ^IDFEEFIX)^^ 360 
RKEVKDTLEKAKASS 375 
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Figure 12 



10 20 30 40 
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ATSAAGAAGGTTAAACITATC^ 40 

ACTACAGATATOCIAAAAAa^TOCTOT 80 

AGTTlKXCTACTOCTrAQjIT^ 120 

ATAGATCAC^AQGMTTAATCAAG^ 160 

AAGAAGAACICCTITrATI\XACA033AAGA 200 

210 220 230 240 
' ' ' ■ I ' « « ■ I ■ ' » ■ I > » ■ ■ I » i ■ i I » ■ ■ ■ I ■ ■ ■ ■ I ■ ■ ■ . | 

CACTITAAT33AAG2QGAAA^ 240 

GGAGCTAGGGAAAACTACAACATA^ 280 

CtCTATCITACX30GATCTITAC^ 320 

03^ITCAACAGTOZAG333ATAGAQ3A^ 360 

AATCTAGCTTTCAATCC03CX3C93AQ^^ 400 

410 420 430 440 
1 1 1 1 I ii'il i i i i I i i t i I i i i i I i i i i I .. i.i .... I 

TTAAAAQCAGQGCAAACGGCITITQ^ 440 

CGCIUiaSGAATTCACTACITCA^ 480 

AGAATACTCTACATAGAX^ 520 

GTOITCAQGAAiXCTITrAC^ 560 

CCTTTOOCTICACCAl^^ 600 

610 620 630 640 
1 1 1 ■ I ■ ■ 1 1 I ■ 1 ■ 1 1 ■ 1 i i 1 ■ ■ ■ 1 1 ■ ■ ■ ■ I ■ ■ ■ ■ I ■ • ■ > I 

GAGAAQ33CTTOTIQ2AGGA^ 640 

AG03CTACAAOITCAACAITOCCCT 680 

CaACAAOGAGITCrTCTTK3QCCT 720 

ATAGIOW^GAACTATITCAGXCQAGCT^ 760 

AACI03GAACTCA0CX2O^ 800 

810 820 830 840 

' 1 1 1 1 1 1 1 1 * 1 ' 1 ' 1 1 ' 1 1 1 I t I I » I I | | I ■ ■ . . | | | | ! | 

ATQCTITOTrcAGGnTro 880 

GAGGCGSATTCCAIXXTIACX3CCCI^^ 920 

CCTAfiTCR3CTXX3AGC^^ 960 

AAQCTAAACAATAAAGCAAAAGAGCITIT 1000 

1010 1020 1030 1040 

1 1 1 1 1 1 1 ■ 1 I " M I I I ■■ I ....I.,.. I .... i | 

A^TTCAAGAGTTIGAQGACG^ 1040 

QCTOSAAAOGCTAAAQQAOCXXna^ 1080 

AQGAAAGAACTAA^^ 1120 
CATCTTA 1127 
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Figure 13 

10 20 30 40 
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MKKVKLIGHIXJYC^ 40 

IDEKELIKSRPATKEEII^^ 80 

GAREKYOTQGYENPVSYSMFTGSS 120 

NVAFNPAGGMHHMT^^ 160 

RILYIIM5AHHC1X3VQE^ 200 

210 220 230 240 

■ ■ ■ ■ I ■ ■ ■ > I ■ t ■ ■ ' i i ■ ■ I ■ ■ ■ ■ I ' ■ ■ ' I ' ■ ■ ' I ' ' ' ' 1 

EKGFLEEIGBGKGKGYN^ 240 

IVKEVFE3?E\A?LIJ3lJG^ 280 

T\jPiEJFGE&JYlJ3S33F^^ 320 

KUSKKAKELiiKSIDFEEJT^ 360 
RKEVKDTLEKAKASS 375 
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Figure 14 
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ATCAAGAAGGTTAAACTIM^ 40 

ACTACAGOTATHXAAAAAO^^ 80 

^TIOOC^ACTOriTASGTITIT 120 

ATACATCAGAAQC^TTAAT^^ 160 

AAC^AGAACTCCTm 2 00 

210 220 230 240 

J ' ' ' ' 1 1 ' 1 1 1 ' 1 ' 1 1 1 I I 1 1 I I I I I I U I I 1 I I ■ | f | , , ! | 

CACnTTAATQSAAQCQ^^ 240 

03AGCTAGC3GAAAACTACAACATAQ303C^TACGAAAArc 280 
OCXTTATCITADGOSATCTI^ 320 

360 

AASUTAG3TICAATCXm333^^ 400 
410 420 430 440 

1 1 " 1 1 1 ' ' I ' ' ' ' I ■ ' ' ' I ' ■ ' ' I I I ■ | I | f , . I . . ■ , \ 

CTAAAAGCAQ3QC2^093CT^ 440 
03C7ITjTT333AATTCAGTACTTCA^ 480 
AGAATACTCIACATAGAOC^TCATO^^ 520 

CCK3I,a:CTI ^^ 600 

610 620 630 640 

1 " ' 1 ' ' 1 ' I ' " ' ' ' ' ' ' I ' i i i » i i i i I i i , i | , i , i | 

GAGAAQ33OT ^^^ 640 

aoaSCTACAAOCTCAACAIT^^ 680 

°GACAA03AGITCXT^^ 720 

ATAGICAAAGAAGTATTTC^^ 76 o 

AACrCQ3AACI ^^ 800 

810 820 830 840 

" 1 1 1 ' 1 1 ' I ■ ' ' ' ' ' ■ ■ i ' ■ » ■ i ' i i i i I i i i t I i i i i I 

^TC^acc 1 ^^ 840 

^^C™ 13 ^^ 880 

GAQQCX3G&TAOIATCCrrA^ 920 

OCraAICTOGTCOGA^ 960 

AAQCTAAACAATA^AGCAAAAG^ 1000 

1010 1020 1030 1040 

- " 1 1 1 ' 1 1 ' 1 1 1 ■' 1 ■■■■ 1 I I .... I 

ACT1TCAftG ^^ 1040 
^-^^AACOCTrAAA 108 0 

AOGAAAGAACTAAAGGATACGC^TCAAAAGGO^AAAa 1320 
CATCTTA 1127 



WO 01/18045 



PCT/US00/24700 



18/263 



Figure 15 
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MKKVKLIGHIJDYGKYRYPKN^ 40 

GAREXYl^nGGYENPVS^ U Q> 

RILYIDLDAHHCDGVQEAI^^ 200 

210; 220 230 240 

" " 1 1 1 1 ' 1 11 11 1 1 i ■■»■■■■ I | ... I ' MM t 

EKGFIJE£IGEGKGK(^ 240 

IVKEA/FEPEVYLDQI/jTOPL]^ 280 

^^^^SSVYI^ 32 0 

KLTOKftKELIJSinFEEra2E^ 360 
RKEVKDTLEKaKASS 375 
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32.727 63.371 1.00 18.14 AAAA 
31.913 62.765 1.00 16.50 AAAA 
32.730 61.556 1.00 16.48 AAAA 
21.863 60.312 1.00 25.58 AAAA 

31.798 59.871 1.00 26.40 AAAA 

32.728 58.701 1.00 33.18 AAAA 
32.487 57.546 1.00 21.05 AAAA 

33.799 58.992 1.00 22.85 AAAA 
33.816 61.832 1.00 16.12 AAAA 
33.577 62.364 1.00 13.97 AAAA 
35.032 61.468 1.00 14.42 AAAA 
36.215 61.763 1.00 17.99 AAAA 
37.070 62.719- 1.00 20.68 AAAA 
36.283 63.964 1.00 22.90 AAAA 
36.906 64.635 1.00 29.36 AAAA 
36.214 64.900 1.00 14.80 AAAA 
37.060 60.594 1.00 19.32 AAAA 
38.260 60.561 1.00 21.23 AAAA 
36.467 59.639 1.00 17.21 AAAA 
37.259 58.516 1.00 15.64 AAAA 
38.348 59.028 1.00 17.31 AAAA 
38.150 60.027 1.00 15.62 AAAA 
39.494 58.348 1.00 19.72 AAAA 
40.598 58.776 1.00 19.88 AAAA 
41.973 58.589 1.00 14.02 AAAA 
42.187 57.206 1.00 17.58 AAAA 
42.009 59.392 1.00 19.67 AAAA 
40.634 58.053 1.00 20.58 AAAA 
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Figure 17-3 
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Figure 17-4 
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Figure 17-5 
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Figure 17-6 
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Figure 17-7 
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Figure 17-8 
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32 317 29.573 46.111 1.00 23.09 7 

33 129 28.632 45.335 1.00 23.05 , 6 

34 063 29.365 44.375 1.00 21.60 6 
33 377 30.379 43.487 1.00 24.09 6 
~2 °69 31.609 43.999 1.00 23,29 6 
3-> 365 32.555 43.199 1.00 23.26 6 
33 154 30.117 42.135 1.00 22.52 6 
32.544 31.061 41.317 1.00 24.82 6 
32.153 32.281 41.857 1.00 27.55 6 
il 553 33.241 41.064 1.00 32.35 8 
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Figure 17-19 
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Figure 17-20 
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18.828 38.954 48.141 1.00 34.41 6 
17.665 38.131 52.219 1.00 37.41 6 
18 125 37.000 52.370 1.00 37.96 8 
16 804 38.694 53.066 1.00 36.21 7 

16 294 38.013 54.253 1.00 36.46 6 
17.263 38.136 55.427 1.00 37.22 6 

17 190 39.440 55.991 1.00 37.41 8 
14 997 38.705 54.653 1.00 35.55 6 
14 889 39.927 54.568 1.00 37.09 8 
14.018 37.928 55.093 1.00 34.33 7 
12 750 38.493 55.532 1.00 36.13 6 
11 596 37.548 55.183 1.00 36.11 6 
11 503 37.222 53.705- 1.00 36.79 6 
11.453 38.487 52.869 1.00 38.55 6 

11 369 38.170 51.389 1.00 41.60 6 
11.503 39.413 50.569 1.00 42.07 7 - 

12 791 38.738 57.043 1.00 34.90 6 
11.758 38.867 57.694 1.00 37.18 8 
13.998 38.775 57.595 1.00 32.82 7 

14 192 39.021 59.016 1.00 31.50 6 

15 477 38.337 59.495 1.00 34.05 6 
15.379 36.839 59.604 1.00 34.54 6 

16 506 36.087 59.940 1.00 35.04 6 

14 161 36.178 59.429 1.00 34.57 6 
16 423 34.691 60.108 1.00 35.44 6 
14.066 34.784 59.594 1.00 36.00 6 

15 201 34.040 59.936 1.00 34.68 6 
14 319 40.530 59.190 1.00 30.94 6 
14.983 41.192 58.394 1.00 30.27 8 
13.693 41.081 60.222 1.00 32.53 7 
13.760 42.527 60.448 1.00 35.83 6 
12.344 43.115 60.570 1.00 37.23 6 
11.478 42.809 59.360 1-00 40.75 6 
11.830 43.148 58.227 1.00 43.88 8 
10.335 42.165 59.594 1.00 39.61 7 
14 553 42.854 61.710 1.00 35.45 6 
14 095 43.621 62.560 1.00 41.47 8 
15.747 42.285 61.827 1.00 33.27 7 

16 *71 42.510 63.004 1.00 30.68 6 
17.638 41.431 63.114 1.00 27.41 6 

17 140 40.002 62.988 1.00 23.76 6 
18.222 39.106 63.543 1.00 27.14 6 
15.855 39.801 63.772 1.00 28.00 6 
17 258 43.856 63.Q33 1.00 32.30 6 
17.347 44.554 62.017 1.00 36.27 8 

17 749 44.207 64.216 1.00 30.33 7 

18 465 45.457 64.424 1.00 30.79 6 
17 816 46.249 65.562 1.00 29.53 6 
17.712 45.471 66.739 1.00 30.43 8 

19 911 45.109 64.768 1.00 33.09 6 

20 194 43.972 65.172 1.00 29.64 8 
20.821 46.069 64.586 1.00 32.36 7 
22.234 45.846 64.896 1.00 31.65 6 
23.036 47.141 64.771 1.00 33.76 6 
23 101 47.658 63.361 1.00 37.76 6 
23 719 48.686 63.100 1.00 36.12 8 
22 460 46.952 62.437 1.00 44.79 7 
22 369 45.333 66.321 1.00 32.61 6 
22 970 44.283 66.565 1.00 27.95 8 
21.803 46.091 67.257 1.00 33.22 7 

21 839 45.741 68.668 1.00 35.52 6 
20.928 46.660 69.481 1.00 37.80 6 
•50.987 46.276 70.964 1.00 39.00 6 
21.356 48.112 69/275 1.00 38.88 6 
21.416 44.300 68.908 1.00 34.26 6 
22.060 43.580 69.679 1.00 35.96 8 
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26.654 22.785 72.005 1.00 36.45 6 
27.084 32.871 70.524 1.00 35.62 6. 
27.829 31.604 70.126 1.00 31.20 6 
25.880 33.080 69.646 1.00 34.51 6 
27.919 32.723 72.857 1.00 37.80 6 
28.182 31.722 73.532 1.00 36.12 8 
28.693 33.808 72.821 1.00 38.45 # 
29.929 33.884 73.567 1.00 40.06 6 
30.551 35.272 73.449 1.00 39.38 6 
30.974 25.625 72.027 1.00 41.90 6 
31.492 27.048 71.968 1.00 41.36 6 
32.647 27.206 72.840 1.00 43.35 7 
23.162 38.373 73.215 1.00 42.55 6 
32.628 39.516 72.797 i.00 29.95 7 
34.220 38.392 74.014 1.00 41.72 7 
29 614 33.587 75.044 1.00 40.01 6 
30.350 22.862 75.716 1.00 39.01 8 
28.506 34.141 75.520 1.00 40.30 7 
28.084 33.923 76.894 1.00 43.19 6 
26.753 24.647 77.165 1.00 47.53 6 
26.875 36.176 77.090 1.00 56.10 6 
25.542 36.923 77.179 1. 00 60.77 6 
24.659 36.682 76.329 1.00 61.41 8 
25.383 37.763 78.096 1.00 62.21 8 

27 953 32.429 77.179 1.00 40.72 6 
28.565 31.922 78.120 1.00 45.29 8 
27.186 21.721 76.354 1.00 34.82 7 
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ATOM 27 07 OD1 AS? A 339 

ATOM 2708 OD2 ASP A 339 

ATOM 2709 C ASP A 3 39 

ATOM 2710 0 ASP A 33 9 

ATOM 2711 N ASP A 340 

ATOM 2712 CA ASP A 340 

ATOM 2713 CB ASP A 340 

ATOM 2714 CG ASP A 3 40 

ATOM 2715 OD1 ASP A 340 

ATOM 2716 OD2 ASP A 3 40 

ATOM 2717 C ASP A 340 

ATOM 2718 O ASP A 340 

ATOM 2719 N GLU A 341 

ATOM 2720 CA GLU A 3 41 

ATOM 2721 CB GLU A 341 

ATOM 2722 CG GLU A 341 

ATOM 2723 CD GLU A 341 

ATOM 2724 OE1 GLU A 341 

ATOM 2725 0£2 GLU A 3 41 

ATOM 2726 C GLU A 341 

ATOM 2727 O GLU A 341 

ATOM 2728 N VAL A 342 

ATOM 2729 CA VAL A 342 

ATOM 2730 CB VAL A 342 

ATOM 2731 CGI VAL A 342 

ATOM 2732 CG2 VAL A 3 42 

ATOM 2733 C VAL A 342 

ATOM 2734 O VAL A 342 

ATOM 2735 N ASP A 34 3 

ATOM- 2736 CA ASP A 343 

ATOM 2737 CB ASP A 343 

ATOM 2738 CG ASP A 343 

ATOM 2739 ODl ASP A 343 

ATOM 2740 OD2 ASP A 343 

ATOM 2741 C ASP A 343 

ATOM 2742 O ASP A 343 

ATOM 2743 N ARG A 344 

ATOM 2744 CA ARG A 344 

ATOM 274 5 CB ARG A 3 44 

ATOM 2746 CG ARG A 344 

ATOM 2747 CD ARG A 3 44 

ATOM 2748 NE ARG A 344 

ATOM 2749 CZ ARG A 344 

ATOM 2750 NH1 ARG A 344 

ATOM 2751 NH2 ARG A 3 44 

ATC I 2752 C ARG A 3 44 

ATCM 2753 O ARG A 3 44 

ATOri 2754 N SER A 3 45 

ATOM 2755 CA SER A 345 

ATOM 2756 CB SER A 345 

ATOM 2757 CG SER A 345 

ATOM 2758 C SER A 345 

ATOM 2759 O SER A 345 

ATOM 2760 N TYR A 346 

ATOM 2761 CA TYR A 346 

ATOM 27 62 CB TYR A 346 

ATOM 2763 CG TYR A 346 

ATOM 27 64 GDI TYR A 346 

ATOM 2765 CE1 TYR A 346 

ATOM 2766 CD2 TYR A 346 

ATCM 2767 CE2 TYR A 346 

A^C V 2768 CZ TYR A 346 

\tC*m 2769 OH TYR A 346 
ATCM 277C C TYR A 346 
*ATC M 2771 O TYR A 34 6 
ATCM 2772 N MET A 347 



11 548 51.893 48.320 1.00 72.29 8 
10 262 50.188 48.858 1.00 71.03 8 

13 715 51.183 46.853 1.00 73.18 6 

14 407 51.884 47.592 1-00 73.78 8 
13 247 51.600 45.677 1.00 76.36 7 
13 518 52.943 45.152 1.00 78.34 6 

12 410 53.385 44.189 1.00 77.55 6 
12 462 52.655 42.864 1.00 78.90 6 

12 348 51.408 42.855 1.00 78.38 8 
12.620 53.336 41.830 1.00 78.74 8 

13 687 54.017 46.214 1.00 79.51 6 
14.587 54.856 46.117 1.00 80.19 8 
12.824 54.000 47.224 1.00 79.64 7 
12.922 54.998 48.271 1.00 80.05 6 
12.269 56.301 47.811 1.00 83.75 6 
12.411 57.442 48.806 1.00 89.02 6 
11.756 58.724 48.328 1.00 91.52 6 
10.515 58.738 48.175 1.00 93.33 8 

12 484 59.716 48.102 1.00 92.65 8 
12.317 54.578 49.597 1.00 77.98 6 
11.102 54.610 49.777 1.00 79.82 8 

13 179 54.181 50.523 1.00 74.49 7 
12.745 53.793 51.859 1.00 71.55 6 
13.224 52.383 52.245 1.00 72.40 6 
12 672 52.004 53.610 1.00 71.16 6 
12.797 51.391 51.207 1.00 74.35 6 
13.454 54.778 52.766 1.00 63.46 6 
12 952 55.154 53.829 1.00 68.96 3 
14.636 55.184 52.311 1.00 61.61 7 
15 486 56.114 53.029 1.00 54.91 6 
14.678 57.303 53.543 1.00 55.06 6 
15.556 58.390 54.114 1.00 54.44 6 
15.002 59.351 54.694 1.00 56.20 8 
16.795 58.287 53.969 1.00 49.32 8 
16.152 55.401 54.198 1.00 50.85 6 

15 557 55.209 55.257 1.00 49.32 8 
17.396 55.004 53.980 1.00 47.84 7 
18.195 54.321 54.981 1.00 45.34 6 
18.883 53.099 54.358 1.00 45.00 6 
17 950 51.969 53.974 1.00 38.03 6 
17.185 51.531 55.188 1.00 35.83 6 

16 278 50.439 54.885 1.00 39.20 7 
15.350 49.993 55.724 1.00 39.89 6 
15 217 50.561 56.917 1.00 40.17 7 
14.566 48.976 55.375 1.00 40.75 7 
19.250 55.278 55.515 1.00 44.72 6 
20 170 54.869 56.223 1.00 46.97 8 
19.113 56.552 55.157 1.00 45.81 7 
20.045 57.596 55.577 1.00 43.66 6 
19.538 58.960 55.115 1.00 43.44 6 
18.292 59.260 55.722 1.00 45.62 8 
20.258 57.627 57.089 1.00 42.79 6 
21.364 57.902 57.552 1.00 42.62 8 
19.200 57.354 57.851 1.00 40.55 7 
19.280 57.352 59.308 1.00 41.05 6 
17.971 56.819 59.905 1.00 41.74* 6 
17.668 55.355 59.630 1.00 43.47 6 
18.331 54.333 60.328 1.00 44.45 6 
18.044 52.983 60.088 1.00 41.02 6 

16 710 54.988 58.682 1.00 42.08 6 
16.416 53.644 58.434 1.00 40.59 6 

17 086 52.649 59.139 1.00 41.66 5 
16 806 51.324 58.884 1.00 39.60 8 
20.466 56.517 59.796 1.00 42.92 6 
21.101 56.844 60.799 1.00 42.65 8 
20.757 55.443 59.067 1.00 44.59 7 
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Figure 18-1 
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Figure 18-3 
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Figure 18-4 
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Figure 18-5 
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Figure 18-16 
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Figure 18-34 
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Figure 18-35 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
•ATOM 
ATOM 
. ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATCM 
ATOM 
ATCM 
ATOM 
ATOM 
ATOM 
ATOM 
ATCM 
ATCM 
ATCM 
ATCM 
ATOM 
ATOM 
ATCM 
ATCM 



2245 

2246 

2247 

2248 

2249 

2250 

2251 

2252 

2253 

2254 

2255 

2256 

2257 

2258 

2259 

2260 

2261 

2262 

2263 

2264 

2265 

2266 

2267 

2268 

2269 

2270 

2271 

2272 

2273 

2274 

2275 

2276 

2277 

2278 

2279 

2280 

2281 

2282 

2283 

2284 

2285 

2286 

2287 

2288 

2289 

2290 

2291 

2292 

2293 

2294 

2295 

2296 

2297 

2298 

2299 

2300 

2301 

2302 

2303 

2204 

2305 

2306 

2307 

2308 

2309 

2310 



CA VAL A 282 
CB .VAL A 282 
CGI VAL A 282 
CG2 VAL A 282 
C VAL A 
O VAL A 
N ARG A 
CA ARG A 283 
CB ARG -A 283 
CG- ARG A 283 
CD ARG A 2 S3 
NE ARG. A 
CZ ARG 
NH1 ARG A 
NH2 ARG A 283 
C ARG A 283 
ARG A 283 



282 
:82 
>83 



233 
283 
?83 



O 
N 

CA 
CB 
CG 
CD 



GLU A 284 
GLU A 284 



284 
284 
284 



GLU 
GLU 
GLU 

OE1 GLU A 284 
OE2 GLU A 284 
GLU 
GLU 
VAL 
VAL 
VAL 



C 
O 
N 

CA 
CB 



CGI VAL 
CG2 VAL 
C VAL 



O 
N 

CA 
C3 
CG 



VAL 
?HE 
PHE 
PHE 
PHE 



CD1 PHE A 
CD2 PHE A 
CE1 PHE A 



284 
284 
285 
285 
285 
285 
285 
285 
285 
286 
286 
286 
286 
286 
286 
286 



CE 
CZ 
C 

o 

N 

CA 
C 
0 
N 

CA 
CB 
CG 
CD 



2 PHE A 286 
286 
286 
286 
287 
287 
287 
287 
288 
288 
288 
288 
288 
288 



PHE 
FHE 
GLY 
GLY 
GLY 
GLY 
GLU 
GLU 
GLU 
GLU 
GLU 



OE1 GLU A 
OE2 GLU A 288 

C 
0 
N 

CA 



0 

:i 

CA 
CB 
CGI 



GLU 
GLU 
GLY 
GLY 
GLY 
GLY 
VAL 
VAL 
VAL 
VAL 



CG2 VAL A 



288 
288 
289 
289 
289 
289 
290 
290 
290 
290 
290 



48. 232 

48.075 

46.641 

48.313 

47.952 

46.884 

48.837 

48.587 

49 .629 

49.551 

50.729 

50.730 

51.826 

53.012 

51.742 

48.561 

47.830 

49.250 

49.376 

50.499 

51.917 

52.989 

53.012 

53 .810 

48.039 

47.525 

47.472 

46 .205 

46.039 

44.654 

47.114 

44.964 

44.043 

44.931 

43 .760 

43 .159 

42.544 

43.104 

41.398 

42.527 

40.813 

41.381 

43 .922 

42.984 

45.086 

45.297 

45.525 

45.914 

45.288 

45.464 

45.613 

46.910 

48.140 

48.144 

49.106 

44.309 

43.144 

44.641 

43.625 

43.922 

44.618 

43.384 

43.612 

42.288 

42.554 

41.308 



22.214 123.081 
20.797 122.523 
20.358 122.841 
20.781 121.018 
22.236 124.558 
22.715 124.917 
21.720 125.406 
21.675 126.840 
20.785 127.519 
19.334 127.061 
18.539 127.554 
18.314 128.990 
18.351 129.742 
18.611 129.198 
18.100 131.035 
23.065 127.473 
23.302 128.439 
23.985 126.928 
25.348 127.448 
26.166 126.799 
25.702 127.141 
26.495 126.401 
27.738 126.542 
25.880 125.680 1 
26.014 127.148 1 
26.783 127.954 1 



PCT7US00/24700 



1.00 30.76 
1.00 35.10 
1.00 28.72 
1.00 28.66 
1.00 31.39 
1.00 32.70 
1.0C 29.86 
1.00 34.82 
1.00 31.44 
1.00 29.49 
1.00 30.67 
1.00 30.78 
1.00 35.27 
1.00 36.46 
1.00 35.90 



1.00 36.06 
1.00 35.04 
1.00 35.70 
1.00 40.93 
1.00 44.17 
.00 56.39 
.00 60.69 
.00 63.13 



1 

1. 
1 



25.704 125.986 
26.294 125.592 
26.291 124.062 
26,811 123.693 
27.153 123.419 
25.638 126.192 
26.336 126.611 
24.308 126.236 
23.608 126.753 
22.723 125.657 
23.490 124.529 
23.459 123.256 
24.245 124.736 
24.170 122.202 
24.958 123.693 
24.919 122.419 
22.773 128.015 
22.080 128.409 
22.840 128.656 
22.056 129.862 
20.590 129.527 
20.264 128.403 
19.710 130.500 
18.273 130.310 
17.576 131.663 
17.864 132.411 
17.455 131:622 
16.334 131.069 
18.245 131.571 
17.623 129.546 
17.957 129.762 
16.694 128.657 
15.999 127.886 
14.510 127.861 
14.007 128.739 
13.307 126.868 
12.373 126.718 
11.626 126.412 
10.148 126.204 
11.822 127.565 



00 62.79 
00 39.24 
00 38.52 
00 33 .75 
00 35.82 
00 34.14 
00 36.43 
00 37.26 
00 38.96 
00 41.83 
00 37.57 
i.00 35.05 
1.00 32.53 
1.00 30.15 
1.00 33.96 
1.00 30.30 
1.00 32.96 
1.00 31.67 
1.00 31.66 



00 35.18 
00 36.97 
00 30.87 
00 30.06 
34.44 
00 32.54 
00 28.42 
00 32.23 
00 37.02 
00 45.36 
00 45.65 
1.00 46.42 
1.00 50.78 
1.00 30.98 
1.00 28.67 
1.00 29.66 
00 28.08 



1 
1 
1 
1 

1.00 

1 

1 

1 

1 

1 
1 



00 30.45 

00 25.90 

00 26.21 

00 27.31 

00 26.53 

1.00 25.36 

i.00 24.38 



WO 01/18045 



152/263 

Figure 18-36 
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64.333 23.471 116.569 1.00 53. CI 
63.439 24.463 116.047 1.00 54.23 
62.234 24.368 116.783 1.00 52.59 
63.538 25.422 115.028 1.00 52.82 
63.655 22.836 117.575 1.00 51.80 
62.393 23.368 117.708 1.00 51.38 
61.134 25.197 116.532 1.00 51.28 
62.444 26.245 114.779 1.00 53.22 
61.257 26.126 115.531 1.00 50.50 
65.935 21.836 113.954 1.00 37.34 
67.041 21.929 113.422 1.00 39.01 
64.809 21.764 113.259 1.00 36.31 
64.797 21.820 111.802 1.00 35.22 
64.317 20.469 111.252 1.00 33.36 
65.310 19.340 111.564 1.00 34.50 
64.729 17.927 111.468 1.00 28.06 

65.745 16.956 111.870 1.00 24.79 
65.499 15.703 112.236 1.00 27.56 
64.253 15.237 112.259 1.00 19.09 
66.502 14.918 112.604 1.00 21.2.4 
63.874 22.955 111.390 1.00 33.74 

62.746 22.732 110.950 1.00 34.68 
64 361 24.181 111.550 1.00 35.60 
63 556 25.345 111.220 1.00 35.43 
63 719 25.932 109.830 1.00 38.08 
64.112 25.250 108.885 1.00 37.29 
63.406 27.218 109.721 1.00 39-67 
63.493 27.925 108.457 1.00 36.36 
62.398 28.966 108.499 1.00 39.45 
61.763 29.131 109.539 1.00 37.58 
62.163 29.662 107.391 1.00 40.89 
61.121 30.682 107.358 1.00 41.37 
61.310 31.627 106.172 1.00 44.64 
60.956 30. "977 104.848 1.00 52.13 
60 833 31.973 103.708 1.00 59.14 
60.448 31.551 102.593 1.00 60.47 
61.119 33.173 103.923 1.00 57.77 
59.770 30.006 107.200 1.00 38.02 
59 689 28.828 106.850 1.00 35.29 
58 708 30.762 107.441 1.00 36.81 
57.363 30.237 107.291 1.00 35.97 
56 401 30.789 108.368 1.00 34.90 
54 999 30.251 108.133 1.00 36.53 
56 888 30.393 109.755 1.00 37.06 
56.886 30.690 105.928 1.00 36.74 
56.661 31.881 105.712 1.00 34.90 

56 753 29.741 105.004 1.00 35.48 
56.301 30.049 103.652 1.00 38.21 
56.152 28.776 102.815 1.00 39.76 

57 416 28.342 102.098 1.00 39.93 

57 225 26.963 101.486* 1.00 38.68 
57.112 25.940 102.525 1.00 39.72 
56 952 24.643 102.286 1.00 38.79 
56.881 24.200 101.036 1.00 32.40 
56 899 23.785 103.297 1.00 36.58 
54 996 30.807 103.603 1.00 38.98 
54 120 30.636 104.452 1.00 39.07 
54.880 31.634 102.573 1.00 39.95 
53 709 32.459 102.339 1.00 42.73 

53 931 33.301 101.078 1.00 44.92 

54 995 34.390 101.219 1.00 55..4S 
56.351 33.842 101.671 1.00 58.28 
56 907 32.809 100.697 1.00 57 . 04 

58 224 32.283 101.151 1.00 58.75 
52.434 31.634 102.200 1.00 40.51 
51.391 31.996 102.748 1-00 36.10 
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51.458 15.533 67.205 1.00 22-45 
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50.395 18.601 69.930 1.00 34.22 
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Figure 18-61 
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Figure 19-2 
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Figure 19-3 
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39.185 52.255 60.881 1.00 34.19 AAAA 
39.554 52.775 61.929 1.00 34.32 AAAA 
39.853 52.384 59.745 1.00 33.99 AAAA 
41.034 53.216 59.680 1.00 35.17 AAAA 
41.812 52.943 58.388 1.00 37.40 AAAA 
42.964 53.908 58.186 1.00 38.64 AAAA 
43.648 54.218 59.184 1.00 40.02 AAAA 
43.201 54.341 57.035 1.00 38.74 AAAA 
40.568 54.670 59.724 1.00 35.85 AAAA 

41.231 55.527 60.306 1.00 36.38 AAAA 
39.420 54.940 59.111 1.00 34.96 AAAA 
38.851 56.280 59.108 1.00 34.47 AAAA 
37.751 56.373 58.067 1.00 33.80 AAAA 
38.291 56.617 60.499 1.00 34.66 AAAA 
38.268 57.779 60.899 1.00 34.55 AAAA 
37.830 55.600 61.226 1.00 34.24 AAAA 
37.287 55.794 62.572 1.00 33.07 AAAA 
36.289 54.687 62.917 1.00 32.82 AAAA 
35.084 54.559 61.996 1.00 32.72 AAAA 
33.980 55.948 62.101 1.00 33.65 AAAA 
33.550 55.878 63.849 1.00 33.77 AAAA 
38.430 55.724 63.583 1.00 33.12 AAAA 
38.226 55.930 64.777 1.00 32.82. AAAA 
39.628 55.428 63.090 1.00 32.64 AAAA 
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40.677 58.838 63.772 1.00 33.52 AAAA 
40.483 54.212 65.009 1.00 31.69 AAAA 
40.565 54.490 66.205 1.00 31.12 AAAA 
40.095 53.010 64.570" 1.00 31.76 AAAA 
39.750 51.898 65,474 1.00 32.48 AAAA 
38.259 51.559 65.386 1.00 32.55 AAAA 
37.231 52.581 65.879 1.00 32.84 AAAA 
35.837 52.089 65.554 1.00 33.79 AAAA 
37.372 52.798 67.376 1.00 32.45 AAAA 
40.555 50.628 65.187 1.00 32.92 AAAA 
40.196 49.530 65.618 1.00 31.64 AAAA 
41.652 50.794 64.464 1.00 34.12 AAAA 
42.508 49.680 64.116 1.00 36.07 AAAA 
42.017 43.991 62.811 1.00 35.51" AAAA 
42.070 49.952 61.635 I. 00 33.37 AAAA 
42.898 47.790 62.480 1.00 35.97 AAAA 
42.854 46.701 63.500 1.00 37.19 AAAA 
43.921 50.226 63.916 1.00 38.85 AAAA 
44.106 51.346 63.413 1.00 38.98 AAAA 
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